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(54) Titre : GENOME DE LISTERIA MONOCYTOGENES, POLYPEPTIDES ET UTILISATIONS 

(57) Abstract: The invention concerns the genome sequence and nucleotide sequences of Listeria monocytogenes EGD-e. The 
invention also concerns polypeptides of said organism, in particular surface or cell envelope polypeptides, or polypeptides involved 
in the different metabolism cycles, in particular vitamin B 12 biosynthesis. The invention further concerns uses of said sequences, and 
various tools enabling identification of L. monocytogenes or associated species. Finally, the invention concerns modified <I>L</I>. 
monocytogenes strains, and pharmaceutical or vaccine compositions using the inventive sequences. 

(57) Abreg6 : La presenle invention conceme la sequence g£nomique et des sequences nueleolidiques de Listeria monocytogenes 
EGD-c. U invention a egalement pour objet les polypeptides de cet organisme, en particulier les polypeptides de surface ou d'enve- 
loppc cellulaire, ou impliques dans les diffcrents cycles de mdtabolisme, en particulier la biosynthese de vitaminc B12. L'invention 
conceme aussi les utilisations des sequences d£crites, ainsi que differents outils permettant V identification de L. monocytogenes ou 
especes associees. L'invention conceme ausi des souches L. monocytogenes modifiees, ainsi que les compositions pharmaceutiques 
ou vaccinales utilisant les sequences de l'invention. 
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GENOME DE Listeria monocytogenes, POLYPEPTIDES ET UTILISATIONS 

L' invention a pour objet la sequence g&iomique et des sequences 
5 nucleotidiques codant pour des polypeptides de Listeria monocytogenes, tels que 
des polypeptides d'enveloppe cellulaire, secretes ou specifiques, ou impliques dans 
le metabolisme, dans le processus de replication ou dans la virulence, ainsi que des 
vecteurs incluant lesdites sequences et cellules ou animaux transformes par ces 
vecteurs. L' invention concerne egalement des procedes de detection de ces acides 

10 nucleiques ou polypeptides et des kits de diagnostic d'infection par Listeria 
monocytogenes, L'invention vise aussi une methode de selection de composes 
capables de moduler V infection bacterienne et un procede de biosynthese ou dfe 
biodegradation de molecules d'interet utilisant lesdites sequences nucleotidiques ou 
Iesdits polypeptides. L'invention comprend enfin des compositions 

15 pharmaceutiques, notamment vaccinates, pour la prevention et/ou le traitement 
defections bacteriennes, en particulier par Listeria monocytogenes. 

Listeria monocytogenes est un pathogene intracellular facultatif II s'agit 
de T agent etiologique de la listeriose, une infection liee a la nourriture posant des 
problemes de sante publique de plus en plus importants, avec un impact 

20 economique important pour Tindustrie alimentaire europeenne. La listeriose est 
Pinfection liee aux aliments la plus letale (mortalite d'environ 30 %). Listeria 
monocytogenes possede la propriete inhabituelle d'etre capable de traverser trois 
barrieres : La barriere intestinale, la barri^re hemato-encephalique et la barriere 
placentaire. Les manifestations cliniques de la listeriose incluent les meningites, 

25 meningo-encephalites, avortements et septicemics. Cette infection est opportuniste 
et affecte principalement les femmes enceintes, les bebes, les personnes ag^es et les 
personnes immuno-d^primees en particulier les personnes atteintes du SID A. Cette 
maladie affecte apparemment egalement les individus sains et est responsable d'un 
nombre important d'^pidemies en raison de produits alimentaires contamines. 

30 Listeria monocytogenes est egalement d'une importance veterinaire avec un risque 
principal pour les ovins (moutons) et les bovins. Listeria monocytogenes est 
particulterement resistante au stress ou aux conditions extremes et il est important 
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de rechercher sa presence avec soin non seulement pour des probtemes de securite 
alimentaire mais egalement pour des problemes de securit6 environnementale. 

La carte physique et une carte gen&ique preliminaire du genome de Listeria 
monocytogenes ont ete etablies pour la souche L028. Neanmoins, aucune carte 
5 genetique fine n'est disponible pour P instant. Le genome de cette bacterie est 
circulaire et comporte environ 3000 kilobases. Son contenu en GC est d'environ 38 
%. Les etudes des facteurs de virulence ont permis Hdentikcatiort d'un locus de 15 
kb qui peut etre considere comme etant un ilot de pathogenicity dans la mesure ou il 
contient la plupart des g&ies dont la fonction dans la virulence a 6t6 clairement 
10 identifiee. En plus de ce locus, quelques autres genes ont ete identifies en particulier 
les genes d' invasion et de motilite et des g6nes qui codent pour une hydrolase 
mureine, une superoxide dismutase, des facteurs sigma, etc... 

Une famille importante de proteines de Listeria monocytogenes est la 
famille des proteines de surface, Les processus d'e volution ont permis le 
15 d^veloppement d'un nombre de mecanismes uniques sur les bacteries. Gram+, par 
lesquels elles peuvent immobiliser des proteines a leur surface. Les fonctions de ces 
difFerentes proteines de parois cellulaires sont extremement diverses. Pourtant, 
beaucoup de proteines liees de fa?on covalente a la surface des pathogenes Gram+ 
sont estimees etre importantes pour la survie du pathogene a Tinterieur de Thote 
20 infecte. Pour Listeria monocytogenes, la capacite a pen6trer dans les cellules 
eukaryotes a ete liee i une famille de proteines de surface et/ou s6cretees, les 
internalines. Jusqu'ici, neuf membres de la famille des internalines ont ete identifies 
(InlA, InlB, InlC, InlC2, InlD, InlE, InlF, InlG, InlH). On pense qu'eiles sont 
ancrees dans la paroi cellulaire par un clivage proteolytique de la liaison T-G du 
motif LPxTG et que la liaison covalente du groupe carboxylique de la threonine h 
un groupe amino libre dans le peptidoglycane. Des etudes recentes ont montre qu'il 
y avait deux classes d' internalines ; un groupe de proteines assoctees k la paroi 
cellulaire de haut point moleculaire (> 50 kDa) et un groupe de plus petites 
proteines (< 30 kDa) qui sont secretees. Ces deux classes possedent cies motifs 
similaires LRR tres homologues, ainsi que les regions flanquantes LRR, et la 
sequence peptide-signal N-terminale. Les petites internalines (s-inl) ne possedent 
pas la region repetee B et la sequence d'ancrage dans la paroi cellulaire et sont ainsi 
secretees. 
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L' etude de Listeria monocytogenes demande de nouvelles approches, en 
particulier genetiques, afin d'ameliorer la comprehension .des . differentes voies 
metaboliques de cet organisme. 

Ainsi, c'est un objet de la presente invention que de divulguer la sequence 
5 complete du genome de Listeria monocytogenes EGD-e deposee & la CNCM le 1 1 
avril 2000 sous le numero 1-2440 ainsi que de tous les genes contenus dans cedit 
genome. 

En effet, la connaissance du genome de cet organisme permet de mieux 
definir les interactions entre les differents genes, les diff&rentes proteines, et par la- 
1 0 meme, les differentes voies metaboliques. En effet, et contrairement a la divulgation 
de sequences isolees, la sequence genomique complete d'un organisme forme un 
tout, permettant d'obtenir immediatement toutes les informations necessaires a cet 
organisme pour croitreetfonctionner. 

La presente invention concerne done une sequence nucleotidique de Listeria 
1 5 monocytogenes caracterisee en ce qu'elle correspond a la sequence SEQ ID N° 1. 

La presente invention concerne egalement une sequence nucleotidique de 
Listeria monocytogenes caracterisee en ce qu'elle est choisie parmi : 

a) une sequence nucleotidique comportant au moins 80 %, 85 %, 90 %, 95 
% ou 98 % d'identite avec SEQ ID N° 1 ; 
20 b) une sequence nucleotidique hybridant dans des conditions de forte 

stringence avec SEQ ID N° 1 ; 
c) une sequence nucleotidique compl6mentaire de SEQ ED 
N° 1 ou comptementaire d'une sequence nucleotidique telle que definie 
en a), ou b), ou une sequence nucleotidique de 1VARN correspondant ; 
25 d) une sequence nucleotidique de fragment representatif de SEQ ID N° I, 

ou de fragment representatif d'une sequence nucleotidique telle que 
definie en a), b) ou c); 
e) une sequence nucleotidique comprenant une sequence telle que definie 
en a), b), c) ou d) ; et 

30 f) une sequence nucleotidique modifiee d'une sequence nucleotidique telle 

que definie en a), b), c), d) ou e). 
De fa?on plus particuliere, la presente invention a egalement pour objet les 
sequences nucleotidiques caracterisees en ce qu'elles sont issues de SEQ ID N° 1 et 
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en ce qu'elles codent pour un polypeptide choisi parmi les polypeptides de sequence 
SEQ ID N° 2 a SEQ ID N° 2854, de preference codant pour un polypeptide 
d'enveloppe cellulaire ou present a la surface de Listeria monocytogenes de 
sequence SEQ ID N° 2 a SEQ ID N° 41 ou codant pour un polypeptide implique 
5 dans la biosynthese de la vitamine B12 de sequence SEQ ED N 0 ' 42 a SEQ ID N°64. 

La presente invention concerne aussi de fafon plus generate les sequences 
nucleotidiques issues de SEQ ID N° 1, et codant pour un polypeptide de L. 
monocytogenes, telles qu'elles peuvent etre isolees a partir de SEQ ED N° 1. 

De plus, les sequences nucleotidiques caracterisees en ce qu'elles 
1 0 comprennent une sequence nucleotidique choisie parmi : 

a) une sequence nucleotidique codant pour un polypeptide choisi parmi les 
sequences SEQ ID N° 2 a SEQ ID N°2854, de preference choisi parmi les 
polypeptides de sequence SEQ ID N° 2 k SEQ ED N° 41 ou SEQ ID N° 42 a 
SEQ ID N° 64; 

15 b) une sequence nucleotidique comportant au moins 80 %, 85 %, 90 %, 95 % ou 
98 % d'identite avec une sequence nucleotidique codant pour un polypeptide 
choisi parmi les sequences SEQ ED N° 2 a SEQ ED N° 2854, de preference 
choisi parmi les polypeptides de sequence SEQ ED N° 2 a SEQ ID N° 41 ou 
SEQ ID N° 42 a SEQ ID N° 64 ; 

20 c) une sequence nucleotidique s'hybridant dans des conditions de forte stringence 
avec une sequence nucleotidique codant pour un polypeptide choisi parmi les 
sequences SEQ ED N° 2 k SEQ ID N° 2854, de preference choisi parmi les 
polypeptides de sequence SEQ ED N° 2 a SEQ ED N° 41 ou SEQ ED N° 42 & 
SEQ ED N° 64 ; 

25 d) une sequence nucleotidique complementaire ou d'ARN correspondant k une 
sequence telle que definie en a), b) ou c) ; 

e) une sequence nucleotidique de fragment representatif d'une sequence telle que 
definie en a), b), c) ou d) ; et 

f) une sequence nucleotidique modifiee d'une sequence telle que definie en a), b), 
30 c), d) ou e), 

sont egalement des objets de T invention. 

Par acide nucleique, sequence nucleique ou d'acide nucleique, 
polynucleotide, oligonucleotide, sequence de polynucleotide, sequence 
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nucleotidique, termes qui seront employes indifferemment dans la presente 
description, on entend designer un enchainement precis de nucleotides, modifies ou 
non, pennettant de definir un fragment ou une region d'un acide nucleique, 
comportant ou non des nucleotides non naturels, et pouvant correspondre aussi bien 
5 a un ADN double brin, un ADN simple brin que des produits de transcription 
desdits ADNs. Ainsi, les sequences nucleiques selon Tinvention englobent 
egalement les PNA (Peptid Nucleic Acid), ou analogues. 

II doit etre compris que la presente invention ne concerne pas les sequences 
nucleotidiques dans leur environnement chroraosomique naturel, c'est-&-dire a l'etat 

1 0 naturel. D s'agit de sequences qui ont ete isolees et/ou purifiees, c'est-a-dire qu'elles 
ont ete prelevees directement ou indirectement, par exemple par copie, leur 
environnement ayant ete au moins partiellement modifie. On entend ainsi egalement 
designer les acides nucleiques obtenus par synthese chimique. 

Par « pourcentage d'identite » entre deux sequences d'acides nucleiques ou 

15 d'acides amines au sens de la presente invention, on entend designer un 
pourcentage de nucleotides ou de residus d'acides amines identiques entre les deux 
sequences a comparer, obtenu apr6s le meilleur alignement, ce pourcentage etant 
purement statistique et les differences entre les deux sequences etant reparties au 
hasard et sur toute leur longueur. On entend designer par "meilleur alignement" ou 

20 "alignement optimal", Mignement pour lequel le pourcentage d'identite determine 
comme ci-apres est le plus eleve. Les comparaisons de sequences entre deux 
sequences d'acides nucleiques ou d'acides amines sont traditionnellement realisees 
en comparant ces sequences apres les avoir alignees de maniere optimale, ladite 
comparaison etant realisee par segment ou par « fenetre de comparaison » pour 

25 identifier et comparer les regions locales de similarite de sequence. L'alignement 
optimal des sequences pour la comparaison peut etre realise, outre manuellement, 
au moyen de I'algorithme d'homologie locale de Smith et Waterman (1981, Ad. 
App. Math. 2 : 482), au moyen de Talgorithme d'homologie locale deNeddleman et 
Wunsch (1970, J. Mol. Biol. 48 : 443), au moyen de la methode de recherche de 

30 similarite de Pearson et Lipman (1988, Proc. Natl, Acad. Sci. USA 85 : 2444), au 
moyen de logiciels informatiques utilisant ces algorithms (GAP, BESTFIT, 
BLAST P, BLAST N, FASTA et TFASTA dans le Wisconsin Genetics Software 
Package, Genetics Computer Group, 575 Science Dr., Madison, WI). Afih d'obtenir 
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Talignenient optimal, on utilise de preference le programme BLAST, avec la 
matrice BLOSUM 62. On peut egalement utiliser les matrices PAM ou PAM250. 

Le pourcentage d'identite entre deux sequences d'acides nucleiques ou 
d'acides amines est determine en comparant ces deux sequences alignees de 
5 maniere optimale dans laquelle la sequence d'acides nucleiques ou d'acides amines 
& comparer peut comprendre des additions ou des deletions par rapport a la 
sequence de reference pour un alignement optimal entre ces deux sequences. Le 
pourcentage d'identite est calcule en determinant le nombre de positions identiques 
pour lesquelles le nucleotide ou le residu d'acide amine est identique entre les deux 

10 sequences, en divisant ce nombre de positions identiques par le nombre total de 
positions comparees et en multipliant le resultat obtenu par 100 pour obtenir le 
pourcentage d'identite entre ces deux sequences. 

Par sequences nucleiques presentant un pourcentage d'identite d'au moins 
80 %, de preference 85 % ou 90 %, de fa9on plus preferee 95 ,% voire 98 %, apres 

15 alignement optimal avec une sequence de reference, on entend designer les 
sequences nucleiques presentant, par rapport a la sequence nucleique de reference, 
certaines modifications comme en particulier une deletion, une troncation, un 
allongement, une fusion chimerique et/ou une substitution, notamment ponctuelle, 
et dont la sequence nucleique presente au moins 80 %, de preference 85 %, 90 %, 

20 95 % ou 98 %, d'identite apres alignement optimal avec la sequence nucleique de 
reference. II s'agit de preference de sequences dont les sequences complementaires 
sont susceptibles de s'hybrider specifiquement avec les sequences de reference. De 
preference, les conditions d' hybridation specifiques ou de forte stringence seront 
telles qu'elles assurent au moins 80 %, de preference 85 %, 90 %, 95 % ou 98 % 

25 d'identite apres alignement optimal entre Tune des deux sequences et la sequence 
cbmplementaire de T autre. 

Une hybridation dans des conditions de forte stringence signifie que les 
conditions de temperature et de force ionique sont choisies de telle maniere qu'elles 
permettent le maintien de l'hybridation entre deux fragments d'ADN 

30 complementaires. A titre illustratif, des conditions de forte stringence de l'dtape 
d' hybridation aux fins de definir les fragments polynucleotidiques decrits ci-dessus, 
sont avantageusement les suivantes. 
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I/hybridation ADN-ADN ou ADN-ARN est reaiisee en deux etapes : (1) 
prehybridation a 42°C pendant 3 heures en tampon phosphate (20 mM, pH 7,5) 
contenant 5 x SSC (1 x SSC correspond a une solution 0,15 M NaCl + 0,015 M 
citrate de sodium), 50 % de formamide, 7 % de sodium dodecyl sulfate (SDS), 10 x 
5 Denhardt's, 5 % de dextran sulfate et 1 % d'ADN de sperme de saumon ; (2) 
hybridation proprement dite pendant 20 heures a une temperature dependant de la 
taille de la sonde (i.e. : 42°C, pour une sonde de taille > 100 nucleotides) suivie de 2 
lavages de 20 minutes a 20°C en 2 x SSC + 2 % SDS, 1 lavage de 20 minutes a 
20°C en 0,1 x SSC + 0,1 % SDS. Le dernier lavage est pratique en 0,1 x SSC + 

10 0,1 % SDS pendant 30 minutes a 60°C pour une sonde de taille > 100 nucleotides. 
Les conditions d'hybridation de forte stringence decrites ci-dessus pour un 
polynucleotide de taille definie, peuvent etre adaptees par l'homme du metier pour 
des oligonucleotides de taille plus grande ou plus petite, selon 1'enseignement de 
Sambrook et al., (1989, Molecular cloning : a laboratory manual. 2 nd Ed. Cold 

1 5 Spring Harbor). 

De plus, par fragment representatif de sequences selon Pinvention, on 
entend designer tout fragment nucleotidique presentant au moins 15 nucleotides, de 
preference au moins 20, 25, 30, 50, 75, 100, 150, 300 et 450 nucleotides consecutifs 
de la s6quence dont il est issu. • : 

20 Par fragment representatif, on entend en particulier une sequence nucleique 

codant pour un fragment biologiquement actif d'un polypeptide, tel que defini plus 
loin. 

Par fragment representatif, on entend egalement les sequences intergeniques, 
et en particulier les sequences nucleotidiques portant les signaux de regulation 
25 (promoteurs, terminateurs, voire enhancers . . . ). 

Parmi lesdits fragments representatifs, on pref&re ceux ayant des sequences 
nucleotidiques correspondant a des cadres ouverts de lecture, denommes sequences 
ORFs (ORF pour « Open Reading Frame »), compris en general entre un codon 
d' initiation et un codon stop, 6u entre deux codons stop, et codant pour des 
30 polypeptides, de preference d'au moins 100 acides amines, tel que par exemple, 
sans s'y limiter, les sequences ORFs qui seront decrites par la suite. 
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La numerotation des sequences nucleotidiques ORFs qui sera utilisee par la 
suite dans la presente description correspond a la numerotation des sequences 
d'acides amines des proteines codees par lesdites ORFs. 

Les fragments representatifs selon F invention peuvent etre obtenus par 
5 exemple par amplification spScifique telle que la PCR ou apres digestion par des 
enzymes de restriction approprtes de sequences nucleotidiques selon Finvention, 
cette methode etant decrite en particulier dans Fouvrage de Sambrook et al.. Lesdits 
fragments representatifs peuvent egalement etre obtenus par synthese chimique lors 
que leur taille n'est pas trop importante, selon des methodes bien connues de 
1 0 Fhomme du metier. 

Parmi les sequences contenant des sequences de T invention, ou des 
fragments representatifs, on entend egalement les sequences qui sont naturellement 
encadrees par des sequences qui presentent au moins 80 %, 85 %, 90 %, 95 % ou 98 
% d'identite avec les sequences selon Finvention. 
15 Par sequence nucleotidique modifiee, on entend toute sequence 

nucleotidique obtenue par mutagenese selon des techniques bien connues de 
Thomme du metier, et comportant des modifications, de preference au maximum 10 
% de nucleotides modifies, par rapport aux sequences normales, par exemple des 
mutations dans les sequences regulatrices et/ou promotrices de Texpression du 
20 polypeptide, notamment conduisant h une modification du taux d'expression ou de 
Tactivite dudit polypeptide. 

Par sequence nucleotidique modifiee, on entend Egalement toute sequence 
nucleotidique codant pour un polypeptide modifie tel que definit ci-apres. 

Les fragments representatifs selon F invention peuvent egalement etre des 
25 sondes ou amorces, qui peuvent etre utilisees dans des procedes de detection, 
d'identification, de dosage ou d'amplification de sequences nucleiques. 

Une sonde ou amorce se definit, au sens de Finvention, comme etant un 
fragment d'acides nucleiques simple brin ou un fragment double brin denature 
comprenant par exemple de 12 bases k quelques kb, notamment de 15 a quelques 
30 centaines de bases, de preference de 15 a 50 ou 100 bases, et possedant une 
specificite d'hybridation dans des conditions determinees pour former un complexe 
d'hybridation avec un acide nucleique cible. 
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Les sondes et amorces selon T invention peuvent etre marquees directement 
ou indirectement par un compose radioactif ou non radioactif par des methodes bien 
connues de l'homme du metier, afin d'obtenir un signal detectable et/ou 
quantifiable. 

5 Les sequences de polynucleotides selon T invention non marquees peuvent 

etre utilisees directement comme sonde ou amorce. 

Les sequences sont generalement marquees pour obtenir des sequences 
utilisables pour de nombreuses applications. Le marquage des amorces ou des 
sondes selon Tinvention est realise par des elements radioactifs ou par des 

1 0 molecules non radioactives. 

Parrai les isotopes radioactifs utilises, on peut citer le ?2 P, le 33 P, le 33 S, le 3 H 
ou le l25 I. Les entites non radioactives sont selectionnees parmi les ligands tels la 
biotine, I'avidine, la streptavidine, la dioxyg^nine, les haptenes, les colorants, les 
agents luminescents tels que les agents radioluminescents, chemoluminescents, 

1 5 bioluminescents, fluorescents, phosphorescents. 

Les polynucleotides selon Tinvention peuvent ainsi etre utilises comme 
amorce et/ou sonde dans des precedes mettant en oeuvre notamment la technique de 
PCR (amplification en chaine par polymerase) (Rolfs et al., 1991, Berlin : Springer- 
Verlag). Cette technique necessite le choix de paires d'amorces oligonucleotidiques 

20 encadrant le fragment qui doit etre amplifie. On peut, par exemple, se referer a la 
technique decrite dans le brevet am&icain U.S. N° 4,683,202. Les fragments 
amplifies peuvent etre identifies, par exemple apres une -efectrophor&se en gel 
d'agarose ou de polyacrylamide, ou apr&s une technique chromatographique comme 
la filtration sur gel ou la chromatographic echangeuse d'ions, puis sequences. La 

25 spScificite de Pamplification peut etre controlee en utilisant comme amorce les 
sequences nucleotidiques de polynucleotides de Tinvention comme matrice, des 
plasmides contenant ces sequences ou encore les produits d' amplification derives. 
Les fragments nucleotidiques amplifies peuvent 6tre utilises comme reactifs dans 
des reactions d' hybridation afin de mettre en evidence la presence, dans un 

30 echantillon biologique, d'un acide nucleique cible de sequence complementaire k 
celle desdits fragments nucleotidiques amplifies. 

L'invention vise ^galement les acides nucleiques susceptibles d'etre obtenus 
par amplification a Taide d'amorces selon Tinvention. 
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D'autres techniques d'araplification de Facide nucleique cible peuvent etre 
avantageusement employees comme alternative a la PCR (PCR-like) a Faide de 
couple d'amorces de sequences nucleotidiques selon Finvention. Par PCR-like on 
entend designer toutes les methodes mettant en oeuvre des reproductions directes ou 
5 indirectes des sequences d'acides nucleiques, ou bien dans lesquelles les systemes 
de marquage ont ete amplifies, ces techniques sont bien entendu connues, en 
general il s'agit de 1'amplification de FADN par une polymerase ; lorsque 
Fechantillon d'origine est un ARN il convient prealablement d'effectuer une 
transcription reverse. II existe actuellement de tres nombreux procedes permettant 

10 cette amplification, comme par exemple la technique SDA (Strand Displacement 
Amplification) ou technique d 'amplification a deplacement de brin (Walker et al., 
1992, Nucleic Acids Res. 20 : 1691), la technique TAS (Transcription-based 
Amplification System) decrite par Kwoh et al. (1989, Proc/NatL Acad. Sci. USA, 
86, 1173), la technique 3SR (Self-Sustained Sequence Replication) decrite par 

15 Guatelli et al. (1990, Proc. Natl. Acad. Sci. USA 87: 1874), la technique NASBA 
(Nucleic Acid Sequence Based Amplification) decrite par Kievitis et al. (1991, J. 
Virol. Methods, 35, 273), la technique TMA (Transcription Mediated 
Amplification), la technique LCR (Ligase Chain Reaction) decrite par Landegren et 
al. (1988, Science 241, 1077), la technique de RCR (Repair Chain Reaction) decrite 

20 par Segev (1992, Kesster C. Springer Verlag, Berlin, New-York, 197-205), la 
technique CPR (Cycling Probe Reaction) decrite par Duck et al. (1990, 
Biotechniques, 9, 142), la technique d'amplification a la Q-beta-replicase decrite 
par Miele et al. (1983, J. Mol. Biol., 171, 281). Certaines de ces techniques ont 
depuis ete perfectionnees. 

25 Dans le cas ou le polynucleotide cible a detecter est un ARNm, on utilise 

avantageusement, prealablement a la mise en oeuvre d'une reaction d'amplification 
k Faide des amorces selon Finvention ou a la mise en oeuvre d'un procede de 
detection a Faide des sondes de Finvention, une enzyme de type transcriptase 
inverse afin d'obtenir un ADNc a partir de FARNm contenu dans Fechantillon 

30 biologique. L'ADNc obtenu servira alors de cible pour les amorces ou les sondes 
mises en oeuvre dans le procedS d'amplification ou de detection selon Finvention. 

La technique d'hybridation de sondes peut etre realisee de manieres diverses 
(Matthews et al, 1988, Anal. Biochem., 169, 1-25). La methode la plus generale 
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consiste a immobiliser l'acide nucleique extrait des cellules de differents tissus ou 
de cellules en culture sur un support (tels que la nitrocellulose, le nylon, le 
polystyrene) et a incuber, dans des conditions bien definies, l'acide nucleique cible 
immobilise avec la sonde. Apres l'hybridation, l'exces de sonde est elimin6 et les 
5 molecules hybrides formees sont detectees par la methode appropriee (mesure de la 
radioactivite, de la fluorescence ou de 1'activite enzymatique liee a la sonde). 

Selon un autre mode de mise en oeuvre des sondes nucleiques selon 
Finvention, ces dernieres peuvent etre utilisees comme sondes de capture. Dans ce 
cas, une sonde, dite « sonde de capture », est immobilisee sur un support et sert k 
10 capturer par hybridation specifique l'acide nucleique cible obtenu a partir de 
I'echantilton biologique a tester et l'acide nucleique cible est ensuite detecte grace a 
une seconde sonde, dite « sonde de detection », marquee par un 6l6ment facilement 
detectable. 

Parmi les fragments d'acides nucleiques interessants, il faut ainsi citer en 

15 particulier les oligonucleotides anti-sens, c'est-a-dire dont la structure assure, par 
hybridation avec la sequence cible, une inhibition de l'expression du produit 
correspondant. II faut egalement citer les oligonucleotides sens qui, par interaction 
avec des proteines impliquees dans la regulation de l'expression du produit 
correspondant, induiront soit une inhibition, soit une activation de cette expression. 

20 De fa?on preferee, les sondes ou amorces selon P invention sont 

immobilisees sur un support, de maniere covalente ou non covalente. En particulier, 
le support peut etre une puce a ADN ou un filtre a haute densite, Egalement objets 
de la presente invention. 

On entend designer par puce a ADN ou filtre haute densite, un support sur 

25 lequel sont fix6es des sequences d'ADN, chacune d'entre elles pouvant etre reperee 
par sa localisation geographique. Ces puces ou filtres different principalement par 
leur taille, le materiau du support, et eventuellement le nombre de sequences 
d'ADN qui y sont fixees. 

On peut fixer les sondes ou amorces selon la premiere invention sur des 

30 supports solides, en particulier les puces a ADN, par differents precedes de 
fabrication. En particulier, on peut effectuer une synthese in situ par adressage 
photochimique ou par jet d'encre. D'autres techniques consistent a effectuer une 
synthese ex situ et a fixer les sondes sur le support de la puce a ADN par adressage 
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mecanique, electronique ou par jet d'encre. Ces differents precedes sont bien 
connus de 1'homme du metier. 

Une sequence nucleotidique (sonde ou amorce) selon Pinvention permet 
done la detection et/ou 1' amplification de sequences nucleiques specifiques. En 
5 particulier, la detection de cesdites sequences est facilitee lorsque la sonde est fix£e 
sur une puce a ADN, ou a un filtre haute densit6. 

L'utilisation de puces a ADN ou de filtres a haute density permet en effet de 
determiner l'expression de genes dans un organisme presentant une sequence 
genomique proche de L. monocytogenes EGD-e. 
10 La sequence genomique de L monocytogenes EGD-e, completee par 

T identification de tous les g£nes de cet organisme, telle que presentee dans la 
presente invention, sert de base a la construction de ces puces a ADN ou filtre. 

La preparation de ces filtres ou puces consiste a synthetiser des 
oligonucleotides, correspondant aux extremites 5' et 3' des g£nes. Ces 
15 oligonucleotides sont choisis en utilisant la sequence genomique et ses annotations 
divulguees par la presente invention. La temperature d'appariement des ces 
oligonucleotides aux places correspondantes sur FADN doit etre 
approximativement la meme pour chaque oligonucleotide. Ceci permet de preparer 
des fragments d'ADN correspondants k chaque g&ie par l'utilisation de condition 
20 de PCR appropriees dans un environnement hautement automatisee. Les fragments 
amplifies sont ensuite immobilises sur des filtres ou des supports en verre, silicium 
ou poly meres synthetiques et ces milieux sont utilises pour P hybridation. 

La disponibilite de tels filtres et/ou puces et de la sequence genomique 
correspondante annotee permet d'etudier Texpression de grands ensembles, voire de 
25 la totalite des genes dans les micro-organismes associes a Listeria monocytogenes, 
en preparant les ADN complementaires, et en les hybridant k l'ADN ou aux 
oligonucleotides immobilises sur les filtres ou les puces. Egalement, les filtres et/ou 
les puces permettent d f etudier la variabilite des souches ou des espSces, en preparant 
TADN de ces organismes et en les hybridant a TADN ou aux oligonucleotides 
30 immobilises sur les filtres ou les puces. 

Les differences entre les sequences genomiques des differentes souches ou 
esptees peuvent grandement affecter Tintensit6 de Thybridation et, par consequent, 
perturber Tinterpretation des resultats. D peut done etre necessaire d'avoir la 
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sequence precise des genes de la souche que Ton souhaite etudier. La methode de 
detection des genes decrite plus loin en detail, impliquant la determination de la 
sequence de fragments aleatoires d'un genome, et les organisant d'apres la sequence 
du genome complet de Listeria monocytogenes EGD-e divulgue dans la presente 
5 invention, peut etre tres utile. 

Les sequences nucleotidiques selon ^invention peuvent etre utilisees dans 
des puces a ADN pour efFectuer F analyse de mutations. Cette analyse repose sur la 
constitution de puces capables d'analyser chaque base d'une sequence nucleotidique 
selon Tinvention. On pourra notamment a cette fin mettre en oeuvre les techniques 

10 de micro-sequen9age sur puce a ADN. Les mutations sont detectees par extension 
d'amorces immobilisees hybridant a la matrice des sequences analysees, juste en 
position adjacente de celle du nucleotide mute recherche. Une matrice simple-brin, 
ARN ou ADN, des sequences a analyser sera avantageusement preparee selon des 
methodes classiques, k partir de produits amplifies selon les techniques de type 

15 PCR. Les matrices d'ADN simple brin, ou d'ARN ainsi obtenues sont alors 
deposees sur la puce a ADN, dans des conditions permettant leur hybridation 
specifique aux amorces immobilisees. Une polymerase thermostable, par exemple 
la Tth ou la Taq ADN polymerase, etend specifiquement l'extremite 3 ! de Tamorce 
immobilisee avec un analogue de nucleotide marque complementaire du nucleotide 

20 en position du site variable; par exemple un cycle thermique est realise en presence 
des dideoxyribonucleotides fluorescents. Les conditions expdrimentales seront 
adaptees notamment aux puces employees, aux amorces immobilisees, aux 
polymerases employees, et au systeme de marquage choisi. Un avantage du 
microsequen9age, par rapport aux techniques basees sur Thybridation de sondes, est 

25 qu'il permet d'identifier tous les nucleotides variables avec une discrimination 
optimale dans des conditions de reactions homogenes; utilise sur des puces a ADN, 
il permet une resolution et une specificite optimales pour la detection routiniere et 
industrielle de mutations en multiplex. 

L'utilisation des filtres a haute densite et/ou des puces permet ainsi d'obtenir 

30 des cormaissances nouvelles sur la regulation des genes dans les organismes 
d f importance industrielle, et en particulier les listeria propagees dans diverses 
conditions. Elle permet aussi une identification rapide des differences entre les 
genomes des souches utilisees dans de multiples applications industridlles. 
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En outre, une puce a ADN ou un filtre peut etre un outil extremement 
interessant pour la determination, la detection et/ou V identification d'un 
microorganisme. Ainsi, on prefere egalement les puces a ADN selon T invention qui 
contiennent en outre au moins une sequence nucleotidique d'un microorganisme 
5 autre de Listeria monocytogenes, immobilize sur le support de ladite puce. De 
preference, le microorganisme choisi I'est parmi les bacteries du genre Listeria (ci- 
apres designees comme bacteries associees a L. monocytogenes), ou les variants de 
Listeria monocytogenes EGD-e. 

Une puce a ADN ou un filtre selon l'invention est un element trbs utile de 

10 certains kits ou n^cessaires pour la detection et/ou 1' identification de 
microorganismes, en particulier les bacteries appartenant a l'espece Listeria 
monocytogenes ou les microorganismes associes, egalement objets de Pinvention. 

Par ailleurs, les puces a ADN ou les filtres selon l'invention, contenant des 
sondes ou amorces specifiques de Listeria monocytogenes, sont des elements tres 

15 avantageux de kits ou necessaires pour la detection et/ou la quantification de 
T expression de genes de Listeria monocytogenes (ou de microorganismes associes). 

En efFet, le controle de Texpression des g6nes est un point critique pour 
optimiser la croissance et le rendement d f une souche, soit en permettant l'expression 
d'un ou plusieurs genes nouveaux, soit en modifiant l'expression de genes dejk 

20 presents dans la cellule. La presente invention fournit l'ensemble des sequences 
naturellement actives chez L. monocytogenes permettant l'expression des genes. 
Elle permet ainsi la determination de l'ensemble des sequences exprimees chez L 
monocytogenes. Elle fournit egalement un outil permettant de reperer les genes dont 
l'expression suit un schema donne. Pour realiser cela, TADN de tout ou partie des 

25 g^nes de L. monocytogenes peut etre amplifie grace a des amorces selon Tinvention, 
puis fixe a un support comme par exemple le verre ou le nylon ou une puce a ADN, 
afin de construire un outil permettant de suivre le profil d'e?q)ression de ces genes. 
Cet outil, constitue de ce support contenant les sequences codantes sert de matrice 
d'hybridation a un melange de molecules marquees refletant les ARN messagers 

30 exprimes dans la cellule (en particulier les sondes marquees selon l'invention). En 
rep&ant cette experience a differents instants et en combinant l'ensemble de ces 
donnSes par un traitement approprie, on obtient alors les profils d'expression de 
Tensemble de ces genes. La connaissance des sequences qui suivent un schema de 
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regulation donnee peut aussi etre mise a profit pour rechercher de maniere dirigee, 
par exemple par homologie, d'autres sequences suivant globalement, mais de 
mantere legSrement differente le meme schema de regulation. En complement, il est 
possible d f isoler chaque sequence de controle presente en amont des segments 
5 servant de sondes et d'en suivre l'activit£ a l'aide de moyen approprie comme un 
gene rapporteur (luciferase, P-galactosidase, GFP pour « Green Fluorescent 
Protein »). Ces sequences isolees peuvent ensuite etre modifiees et assemblies par 
ingenierie metabolique avec des sequences d'int6ret en vue de leur expression 
optimale. 

10 Grace k la sequence genomique presentee dans la presente invention, 

Phomme du metier saura identifier les genes codant pour des proteines regulant la 
transcription des genes de L monocytogenes. Par ailleurs, le tableau I fournit la liste 
des phases ouvertes de lecture (ORF pour « Open Reading Frame) identifies sur le 
genome de Listeria monocytogenes (SEQ ID N° 1), avec leur position sur ledit 

15 genome, et les fonctions putatives qui peuvent leur etre attributes. Toutefois, 
une telle liste ne doit pas etre consideree comme limitative, une proteine pouvant 
faire avoir plusieurs roles dans la cellule. 

Modifier la structure ou Tintegrite de ces genes pourra permettre de modifier 
Texpression des genes cibles controles par des promoteurs cibles de ces regulateurs. 

20 Ainsi, Thomme du metier pourra choisir le ou les regulateurs pertinents pour 
Fapplication recherchee ainsi que leur cible, ce qui permet Poptimisation de 
Texpression de genes d'interet. ^utilisation des outils precedemment decrits tels les 
puces k ADN, permet aussi de reperer l'ensemble des genes dont la regulation est 
modifiee par Tinactivation de certains genes. II est ainsi possible de s&ectionner un 

25 ensemble de sequence de contrdle repondant, k des nuances pr6s, a un meme type 
de regulation. Ces sequences peuvent etre alors utilisees pour controler Texpression 
de g^nes d'interet. 

L' invention concerne egalement les polypeptides codes par une sequence 
nucteotidique selon Tinvention, de preference, par un fragment representatif de la 
30 sequence SEQ ID N° 1 et correspondant a une sequence ORF, telle que decrite dans 
le tableau L En particulier, les polypeptides de Listeria monocytogenes caracterises 
en ce qu'ils sont choisis parmi les polypeptides de sequence SEQ ID N° 2 a SEQ ID 
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N° 2854, de preference de sequence SEQ ID N° 2 a SEQ ID N° 41 et SEQ ED N° 42 
a SEQ ID N° 64 sont objet de l'invention. 

L'invention comprend egalement les polypeptides caracterises en ce qu'ils 
comprennent un polypeptide choisi parmi : 
5 a) un polypeptide selon l'invention ; 

b) un polypeptide presentant au moins 80 % de preference 85 %, 90 %, 95 
% et 98 % d'identite avec un polypeptide selon l'invention ; 

c) un fragment d'au moins 5 acides amines d'un polypeptide selon 
l'invention, ou tel que defini en b) ; 

0 d) un fragment biologiquement actif d'un polypeptide selon l'invention, ou 

tel que defini en b) ou c) ; et 
e) un polypeptide modifie d'un polypeptide selon l'invention, ou tel que 

defini en b), c) ou d). 
Les sequences nucleotidiques codant pour les polypeptides decrits 
5 precedemment sont egalement objet de l'invention. 

Dans la presente description, les termes polypeptides, sequences 
polypeptidiques, peptides et proteines sont interchangeables. 

II doit etre compris que l'invention ne concerne pas les polypeptides sous 
forme natureile, c'est-a-dire qu'ils ne sont pas pris dans leur environnement naturel 
mais qu'ils ont pu 6tre isoles ou obtenus par purification a partir de sources 
natureiles, ou bien obtenus par recombinaison genetique, ou par synthese chimique, 
et qu'ils peuvent alors comporter des acides amines non naturels comme cela sera 
decrit plus loin. 

Par polypeptide presentant un certain pourcentage d'identite avec un autre, 
que Ton designera egalement par polypeptide homologue, on entend designer les 
polypeptides presentant par rapport aux polypeptides naturels, certaines 
modifications, en particulier une deletion, addition ou substitution d'au moins un 
acide amine, une troncation, un allongement, une solution chimerique et/ou une 
mutation, ou les polypeptides presentant des modifications post-traductionnelles. 
Parmi les polypeptides homologues, on prefere ceux dont la sequence d' acides 
amines presentent au moins 80 %, de preference 85 %, 90 %, 95 % et 98 % 
d'homologie avec les sequences d'acides amines des polypeptides selon l'invention. 
Dans le cas d'une substitution, un ou plusieurs acide(s) amine(s) cons£cutifl[s) ou 
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non consecutif(s) soat remplaces par des acides amines « equivalents ». 
L'expression « acides amines equivalents » vise ici a designer tout acide amine 
susceptible d'etre substitue a Tun des acides amines de la structure de base sans 
cependant modifier essentiellement les activites biologiques des peptides 
5 correspondant et telles qu'elles seront definies par la suite. 

Ces acides amines equivalents peuvent etre determines soit en s'appuyant 
sur leur homologie de structure avec les acides amines auxquels ils se substituent, 
soit sur des resultats d'essais comparatifs d'activite biologique entre les differents 
polypeptides susceptibles d'etre effectu£s. 

10 A titre d'exemple, on mentionne les possibilites de substitution susceptibles 

d'etre effectuees sans qu'il resulte en une modification approfondie de l'activite 
biologique du polypeptide modifie correspondant. On peut remplacer ainsi la 
leucine par la valine ou l'isoleucine, 1'acide aspartique par l'acide glutamine, la 
glutamine par Tasparagine, l'arginine par la lysine, etc... les substitutions inverses 

1 5 etant naturellement envisageables dans les memes conditions. 

Les polypeptides homologues correspondent egalement aux polypeptides 
codes par les sequences nucleotidiques homologues ou identiques, telles que 
definies precedemment et comprennent ainsi dans la pr&ente definition des 
polypeptides mutes ou correspondant a des variations inter ou intra especes, 

20 pouvant exister chez Listeria, et qui correspondent notamment a des troncatures, 
substitutions, deletions et/ou additions, d'au moins un residu d'acides amines. 

II est entendu que 1'on calcule le pourcentage d'identite entre deux 
polypeptides de la meme fa9on qu'entre deux sequences d'acides nucleiques. Ainsi, 
le pourcentage d'identite entre deux polypeptides est calcule apr6s alignement 

25 optimal de ces deux sequences, sur une fenetre d'homologie maximale. Pour definir 
ladite fenetre d'homologie maximale, on peut utiliser les memes algorithmes que 
pour les sequences d' acide nucleique. 

Par fragment biologiquement actif d'un polypeptide selon 1' invention, on 
entend designer en particulier un fragment de polypeptide, tel que defmi ci-apr6s, 

30 presentant au moins une des caracteristiques biologiques des polypeptides selon 
l'invention, notamment en ce qu'il est capable d'exercer de maniere generate une 
activite meme partielle, tel que par exemple : 
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- une activite enzymatique (metabolique) ou une activite pouvant etre 
impliquee dans la biosynthese ou la biodegradation de composes 
organiques ou inorganiques ; 

• une activite structured (enveloppe cellulaire, molecule chaperonne, 
5 ribosome) ; 

- une activite de transport (d'energie, d'ion) ; ou dans la secretion de 
proteine ; 

- une activite dans le processus de replication, amplification, preparation, 
transcription, traduction ou maturation, notamment de TADN, de FARN 

10 ou des prolines. 

Par fragment de polypeptides selon ['invention, on entend designer un 
polypeptide comportant au minimum 5 acides amines, de preference 10, 15, 25, 50, 
100 et 150 acides amines. 

Les fragments de polypeptides peuvent correspondre a des fragments isoles 

15 ou purifies naturellement presents dans les souches de Listeria, ou a des fragments 
qui peuvent etre obtenus par clivage dudit polypeptide par une enzyme proteolitique 
telle que la trypsine ou la chymotrypsine ou la collagenase, par un reactif chimique 
(bromure de cyanogene, CNBr) ou en pla$ant ledit polypeptide dans un 
environnement tres acide (par exemple a pH = 2,5). Des fragments polypeptidiques 

20 peuvent egalement etre prepares par synthese chimique, k partir d'hotes transformes 
par un vecteur d'expression selon Pinvention qui contiennent un acide nucleique 
pennettant Fexpression dudit fragment, et place sous le controle des elements de 
regulation et/ou d'expression appropries. 

Par « polypeptide modifie » d'un polypeptide selon Tinvention, on entend 

25 designer un polypeptide obtenu par recombinaison genetique ou par synthese 
chimique comme decrit plus loin, qui presente au moins une modification par 
rapport a la sequence normale, de preference au plus 10 % d'acides amines modifies 
par rapport a la sequence normale. Ces modifications peuvent etre notamment 
portees sur des acides amines necessaires pour la sp6cificite ou Pefficacite de 

30 T activite, ou a Forigine de la conformation structural, de la charge, ou de 
Thydrophobicite du polypeptide selon Tinvention. On peut ainsi creer des 
polypeptides d'activite equivalente, augmentee ou diminuee, ou de specificite 
equivalents plus etroite ou plus large. Parmi les polypeptides modifies, il faut citer 
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les polypeptides dans lesquels jusqu'a cinq acides amines peuvent etre modifies, 
tronques & Textremite N ou C-terminale, ou bien deletes, ou ajoutes. 

Comme cela est indique, les modifications d'un polypeptide ont pour 
objectif notamment : 

5 -de permettre sa mise en ceuvre dans des procedes de biosynthese ou de 

biodegradation de composes organiques ou inorganiques, 
de permettre sa mise en oeuvre dans des procedes de replication, 
d'amplification, de reparation et regie de transcription, de traduction, ou 
de maturation notamment de 1' ADN, TARN, ou de proteines, 
10 -de permettre sa secretion amelioree, 

- de modifier sa solubiiite, l'efficacite ou la specificite de son activate, ou 

encore de faciliter sa purification. 
La synthese chimique presente egalement Pavantage de pouvoir utiliser des 
acides amines non naturels ou des liaisons non peptidiques. Ainsi, il peut etre 
1 5 interessant d'utiliser des acides amines non naturels, par exemple sous forme D, ou 
des analogues d'acides amines, notamment des formes souffrees. 

La presente invention fournit la sequence nucleotidique du genome de 
Listeria monocytogenes EGD-e, ainsi que certaines sequences polypeptidiques. 
L'homme du metier pourra determiner les autres ORFs, en utilisant des methodes 
20 connues, et des logiciels appropries. 

Parmi les genes identifies dans la sequence genomique de L. 
monocytogenes, on peut citer en particulier les genes impliques dans la biosynth&se 
de la vitamine B12 (SEQ ID N° 42 a SEQ ED N° 64). Cette bacterie est ainsi 
capable de synth&iser naturellement cette vitamine, et la connaissance des genes 
25 menant a leur synthese permet a Thomme du metier d'optimiser l'expression de ces 
genes ou de les modifier en vue d'augmenter la production de cette vitamine. Ainsi, 
la presente invention a egalement pour objet un proced6 de production de la 
vitamine B12, caracterise en ce que Ton fournit le substrat de depart a une cellule 
hote contenant les genes correspondant a SEQ ID N° 42 a SEQ ID N° 64, que l'on 
30 la cultive dans les conditions approprtees pour la production de vitamine B12, et 
que l'on recupere ladite vitamine. De preference, la cellule hote est une cellule 
bacterienne, de fa?on plus preferee, une bacterie du genre Bacillus ou Listeria. Un 
procede de production de la vitamine B12 utilisant une sequence nucleique, ou 
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polypeptidique selon Finvention, ou une cellule hote selon Finvention ou un animal 
ou un vegetal selon F invention est egalement un objet de la presente invention. 

De maniere generate, la liste des sequences SEQ ED, ou leur sequence 
f nucleique codante correspondante, pourra etre determinee par Fhonune de Fart a 
5 parti des fonctions putatives les plus probables d&erminees pour chacune des 
sequences SEQ ID dans le tableau 1 ci-apr6s pour chacune des classes d'activite 
repertorie ct-apres. 

Ainsi et de maniere preferee, Finvention est relative i une sequence 

nucleotidique selon Finvention caracterisees en ce qu'elle code pour un polypeptide 
10 de Listeria monocytogenes ou un de ses fragments implique dans la biosynthese de 

la vitamine B12. De preference, il s'agit d'un polypeptide de sequence SEQ ID N° 

42 k SEQ ID N° 64. 

De maniere preferee, Finvention est relative a une sequence nucleotidique 

selon l'invention caracterisees en ce qu'elle code pour un polypeptide d'enveloppe 
1 5 cellulaire ou present a la surface de Listeria monocytogenes ou un de ses fragments. 

De preference, il s'agit d'un polypeptide de sequence SEQ ID N° 2 a SEQ ED N° 

41. 

De maniere preferee, Finvention est relative a une sequence nucleotidique 
selon Finvention, caracterisees en ce qu'elle code pour un polypeptide de Listeria 
20 monocytogenes ou un de ses fragments implique dans la biosynthese des acides 
amines. 

De maniere preferee, l'invention est relative a une sequence nucleotidique 
selon l'invention caracterisees en ce qu'elle code pour un polypeptide de Listeria 
monocytogenes ou un de ses fragments implique dans la biosynthese des cofacteurs, 
25 groupes prosthetiques et transporters. 

De maniere preferee, Finvention est relative a une sequence nucleotidique 
selon Finvention caracterisees en ce qu'elle code pour un polypeptide de Listeria 
monocytogenes ou un de ses fragments implique dans la machinerie cellulaire. 

De maniere preferee, Finvention est relative a une sequence nucleotidique 
30 selon Finvention caracterisees en ce qu'elle code pour un polypeptide de Listeria 
monocytogenes ou un de ses fragments implique dans le metabolisme intermediate 
central. 
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De maniere preferee, ('invention est relative a une sequence nucleotidique 
selon Pinvention caracterisees en ce qu'elle code pour un polypeptide de Listeria 
monocytogenes ou un de ses fragments implique dans le metabolisme energetique. 
De maniere preferee, Pinvention est relative a une sequence nucleotidique 
5 selon l'invention caracterisees en ce qu'elle code pour un polypeptide de Listeria 
monocytogenes ou un de ses fragments implique dans le metabolisme des acides 
gras et des phospholipides. 

De maniere preferee, F invention est relative & une sequence nucleotidique 
selon Pinvention caracterisees en ce qu'elle code pour un polypeptide de Listeria 
10 monocytogenes ou un de ses fragments implique dans le metabolisme des 
nucleotides, des purines, des pyrimidines ou nucleosides. 

De maniere preferee, Pinvention est relative a une sequence nucleotidique 
selon Pinvention caracterisees en ce qu'elle code pour un polypeptide de Listeria 
monocytogenes ou un de ses fragments implique dans les fonctions de regulation. 
15 De maniere preferee, Pinvention est relative a une sequence nucleotidique 

selon Pinvention caracterisees en ce qu'elle code pour un polypeptide de Listeria 
monocytogenes ou un de ses fragments implique dans le processus de replication. 

De maniere preferee, Pinvention est relative a une sequence nucleotidique 
selon l'invention caracterisees en ce qu'elle code pour un polypeptide de Listeria 
20 monocytogenes ou un de ses fragments implique dans le processus de transcription. 

De maniere preferee, Pinvention est relative h une sequence nucleotidique 
selon Pinvention caracterisees en ce qu'elle code pour un polypeptide de Listeria 
monocytogenes ou un de ses fragments implique dans le processus de traduction. 

De maniere preferee, l'invention est relative k une sequence nucleotidique 
25 selon l'invention caracterisees en ce qu'elle code pour un polypeptide de Listeria 
monocytogenes ou un de ses fragments implique dans le processus de transport et de 
liaison des proteines. 

De maniere preferee, Pinvention est relative a une sequence nucleotidique 
selon Pinvention caracterisees en ce qu'elle code pour un polypeptide de Listeria 
30 monocytogenes ou un de ses fragments implique dans Padaptation aux conditions 
atypiques. 

De maniere preferee, l'invention est relative a une sequence nucleotidique 
selon Pinvention caracterisees en ce qu'elle code pour un polypeptide de Listeria 
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monocytogenes ou un de ses fragments dans la sensibilite aux medicaments et 
analogues. 

De maniere preferee, T invention est relative a une sequence nucleotidique 
selon Pinvention caract&isees en ce qu'elle code pour un polypeptide de Listeria 
5 monocytogenes ou un de ses fragments implique dans les fonctions relatives aux 
transposons. 

De maniere preferee, Pinvention est relative a une sequence nucleotidique 
selon Pinvention caracterisees en ce qu'elle code pour un polypeptide specifique de 
Listeria monocytogenes ou un de ses fragments. 
10 Sous un autre aspect, de mantere preferee, Pinvention a pour objet un 

polypeptide selon Pinvention, caracterise en ce qu'il s'agit d'un polypeptide de 
Listeria monocytogenes ou un de ses fragments implique dans la biosynthese de la 
vitamine B12. De preference, il s'agit d'un polypeptide de sequence SEQ ID N° 42 
a SEQ ID N° 64. 

15 Sous un autre aspect, de manure preferee, Pinvention a pour objet un 

polypeptide selon Pinvention, caracterise en ce qu'il s'agit d'un polypeptide 
d'enveloppe cellulaire ou de surface de Listeria monocytogenes ou un de ses 
fragments. De preference, il s'agit d'un polypeptide de sequence SEQ ID N° 2 a 
SEQ ID N° 41. 

20 Sous un autre aspect, de maniere preferee, Pinvention a pour objet un 

polypeptide selon Pinvention, caracterise en ce qu'il s'agit d'un polypeptide de 
Listeria monocytogenes ou un de ses fragments implique dans la biosynthese des 
acides amines. 

Sous un autre aspect, de manure preferee, Pinvention a pour objet un 
25 polypeptide selon Pinvention, caracterise en ce qu'il s'agit d'un polypeptide de 
Listeria monocytogenes ou un de ses fragments implique dans la biosynthese des 
cofacteurs, groupes prosthetiques et transporters. 

Sous un autre aspect, de maniere preferee, Pinvention a pour objet un 
polypeptide selon Pinvention, caracterise en ce qu'il s'agit d'un polypeptide de 
30 Listeria monocytogenes ou un de ses fragments implique dans la machinerie 
cellulaire. 

Sous un autre aspect, de mantere preferee, Pinvention a pour objet un 
polypeptide selon Pinvention, caracterise en ce qu'il s'agit d'un polypeptide de 
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Listeria monocytogenes ou un de ses fragments implique dans le metabolisme 
intermediate central. 

Sous un autre aspect, de maniere preferee, l'invention a pour objet un 
polypeptide selon l'invention, caracterise en ce qu'il s'agit d'un polypeptide de 
5 Listeria monocytogenes ou un de ses fragments implique dans le metabolisme 
energetique. 

Sous un autre aspect, de maniere preferee, Pinvention a pour objet un 
polypeptide selon Pinvention, caracterise en ce qu'il s'agit d'un polypeptide de 
Listeria monocytogenes ou un de ses fragments implique dans le metabolisme des 
1 0 acides gras et des phospholipides. 

Sous un autre aspect, de maniere preferee, l'invention a pour objet un 
polypeptide selon l'invention, caracterise en ce qu'il s'agit d'un polypeptide de 
Listeria monocytogenes ou un de ses fragments implique dans le metabolisme des 
nucleotides, des purines, des pyrimidines ou nucleosides. 
15 Sous un autre aspect, de mantefe preferee, l'invention a pour objet un 

polypeptide selon l'invention, caracterise en ce qu'il s'agit d'un polypeptide de 
Listeria monocytogenes ou un de ses fragments implique dans les fonctions de 
regulation. 

Sous un autre aspect, de mantere preferee, l'invention a pour objet un 
20 polypeptide selon l'invention, caracterise en ce qu'il s'agit d'un polypeptide de 
Listeria monocytogenes ou un de ses fragments implique dans le processus de 
replication. 

Sous un autre aspect, de maniere preferee, l'invention a pour objet un 
polypeptide selon l'invention, caracterise en ce qu'il s'agit d'un polypeptide de 
25 Listeria monocytogenes ou un de ses fragments implique dans le processus de 
transcription. 

Sous un autre aspect, de mantere preferee, l'invention a pour objet un 
polypeptide selon l'invention, caracterise en ce qu'il s'agit d'un polypeptide de 
Listeria monocytogenes ou un de ses fragments impliqu6 dans le processus de 
30 traduction. 

Sous un autre aspect, de maniere preferee, l'invention a pour objet un 
polypeptide selon l'invention, caracterise en ce qu'il s'agit d'un polypeptide de 
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Listeria monocytogenes ou un de ses fragments implique dans le processus de 
transport et de liaison des proteines. 

Sous un autre aspect, de maniere preferee, I' invention a pour objet un 
polypeptide selon l'invention, caracterise en ce qu'il s'agit d'un polypeptide de 
5 Listeria monocytogenes ou un de ses fragments implique dans Padaptation aux 
conditions atypiques. 

Sous un autre aspect, de maniere preferee, l'invention a pour objet un 
polypeptide selon l'invention, caracterise en ce qu'il s'agit d'un polypeptide de 
Listeria monocytogenes ou un de ses fragments dans la sensibilite aux medicaments 
10 et analogues. 

Sous un autre aspect, de maniere preferee, l'invention a pour objet un 
polypeptide selon l'invention, caracterise en ce qu'il s'agit d'un polypeptide de 
Listeria monocytogenes ou un de ses fragments implique dans les fonctions 
relatives aux transposons. 

15 Sous un autre aspect, de mantere preferee, l'invention a pour objet un 

polypeptide selon l'invention, caracterise en ce qu'il s'agit d'un polypeptide 
specifique de Listeria monocytogenes ou un de ses fragments. 

II est important de noter toutefois qu'un organisme vivant est un tout et doit 
etre pris comme tel. Ainsi, afin de pouvoir se developper et d'exhiber ses proprietes, 

20 tout organisme a besoin d'interactions entre les difftrentes voies metaboliques. 
Ainsi, la classification enoncee ci-dessus ne doit pas etre consideree comme 
limitative, un gene pouvant Stre impliqud dans deux voies metaboliques distinctes. 

La presente invention a egalement pour objet les sequences nucleotidiques 
et/ou de polypeptides selon l'invention, caracterisees en ce que lesdites sequences 

25 sont enregistrees sur un support d'enregistrement dont la forme et la nature 
facilitent la lecture, Panalyse et/ou I'exploitation de ladite ou desdites sequence(s). 
Ces supports peuvent egalement contenir d'autres informations extraites de la 
presente invention, notamment les analogies avec des sequences deji connues, et/ou 
des informations concernant les sequences nucleotidiques et/ou de polypeptides 

30 d'autres microorganismes afin de faciliter Panalyse comparative et I'exploitation 
des resultats obtenus. 

Parmi cesdits supports d'enregistrement, on prefere en particulier les 
supports lisibles par un ordinateur, tels les supports magnetiques, optiques, 
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electriques ou hybrides, en particulier les disquettes informatiques, les CD-ROM, 
les serveurs informatiques. De tels supports d'enregistrement sont egalement objet 
de P invention. 

Les supports d'enregistrement selon Pinvention, avec les informations 
5 apportees, sont tres utiles pour le choix d'amorces ou de sondes nucleotidiques pour 
la determination de gdnes dans Listeria monocytogenes ou souches proches de cet 
organisme. De meme, Putilisation de ces supports pour P etude du polymorphisme 
genetique de souche proche de Listeria monocytogenes, en particulier par la 
determination des regions de colinearite, est tres utile dans la mesure ou ces 

10 supports fournissent non seulement la sequence nucleotidique du genome de 
Listeria monocytogenes egb, mais egalement P organisation genomique dans ladite 
sequence. Ainsi, les utilisations de supports d'enregistrement selon Pinvention sont 
egalement des objets de Pinvention. 

L'analyse d'homologie entre differentes sequences s'effectue en effet 

1 5 avantageusement a Paide de logiciels de comparaisons de sequences, tels le logiciel 
Blast, ou les logiciels de la trousse GCG, decrits precedemment, 

L' invention vise egalement les vecteurs de clonage et/ou d'expression, qui 
contiennent une sequence nucleotidique selon Pinvention. On prefere en particulier, 
les sequences nucleotidiques codant pour des polypeptides d'enveloppe cellulaire 

20 ou de surface, ou impliques dans la machinerie cellulaire, en particulier la secretion, 
le metabolisme intermediate central, en particulier la production de sucre, le 
metabolisme energetique, les processus de synthase de la vitatnine B12, de 
transcription et de traduction, de synthese de polypeptides. 

Les vecteurs selon Pinvention comportent de preference des elements qui 

25 permettent Pexpression et/ou la secretion des sequences nucleotidiques dans une 
cellule hote determin6e. 

Le vecteur doit alors comporter un promoteur, des signaux d'initiation et de 
terminaison de la traduction, ainsi que des regions appropriees de regulation de la 
transcription, n doit pouvoir etre maintenu de fa?on stable dans la cellule hote et 

30 peut eventuellement posseder des signaux particuliers qui specifient la secretion de 
la proteine traduite. Ces differents Elements sont choisis et optimises par Phomme 
du metier en fonction de Phote cellulaire utilise. A cet effet, les sequences 
nucleotidiques selon Pinvention peuvent etre ins6rees dans des vecteurs a 
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replication autonome au sein de I'hote choisi, ou etre des vecteurs integratifs de 
I'hote choisi. 

De teis vecteurs sont prepares par des methodes couramment utilisees par 
1'homme du metier, et les clones resultant peuvent &re introduits dans un hote 
5 approprte par des methodes standards, telle que la iipofection, l'electroporation, le 
choc thermique, ou des methodes chimiques. 

Les vecteurs selon Pinvention sont par exemple des vecteurs d'origine 
plasmidique ou virale. lis sont utiles pour transformer des cellules hotes afin de 
doner ou d'exprimer les sequences nucleotidiques selon l'invention. 

10 L'invention comprend egalement les cellules hotes transformees par un 

vecteur selon 1'invention. 

L'hote cellulaire peut etre choisi parmi des systemes procaryotes ou 
eucaryotes, par exemple les cellules bacteriennes mais egalement les cellules de 
levure ou les cellules animales, en particulier les cellules de mammiferes. On peut 

15 egalement utiliser des cellules d'insectes ou des cellules de plantes. Les cellules 
hotes preferees selon l'invention sont en particulier les cellules procaryotes, de 
preference les bacteries appartenant au genre Listeria, k l'esp^ce Listeria 
monocytogenes, ou les microorganismes associes a l'espece Listeria 
monocytogenes, L'invention concerne egalement les animaux, excepte 1'homme, 

20 qui comprennent une cellule transformee selon l'invention. Les cellules 
transformees selon l'invention sont utilisables dans des procedes de preparation de 
polypeptides recombinants selon l'invention. Les procedes de preparation d'un 
polypeptide selon l'invention sous forme recombinante, caracterises en ce qu'ils 
mettent en oeuvre un vecteur et/ou une cellule transformee par un vecteur selon 

25 l'invention sont eux-memes compris dans la presente invention. De preference, on 
cultive une cellule transformee par un vecteur selon l'invention dans des conditions 
qui permettent 1'expression dudit polypeptide et on recupere ledit peptide 
recombinant. Les cellules h6tes selon l'invention peuvent egalement etre utilisees 
pour la preparation de compositions alimentaires, qui sont elles-memes objet de la 

30 presente invention. 

Ainsi qu'il a ete dit, I'hote cellulaire peut etre choisi parmi des systemes 
procaryotes ou eucaryotes. En particulier, il est possible d' identifier des sequences 
nucleotidiques selon l'invention, facilitant la secretion dans un tel syst&ne 
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procaryote ou eucaiyote. Un vecteur selon T invention portant une telle sequence 
peut done etre avantageusement utilise pour la production de prolines 
recombinantes, destinees a etre secretees. En effet, la purification de ces proteines 
recombinantes d'intergt sera facilite par le fait qu'elles sont pr6sentent dans le 
5 surnageant de la culture cellulaire plutot qu'a 1'interieur des cellules hotes. 

On peut egalement preparer les polypeptides selon P invention par synthese 
chimique. Un tel procede de preparation est egalement un objet de l'invention. 
L'homme du metier connalt les precedes de synthese chimique, par exemple les 
techniques mettant en oeuvre des phases solides (voir notamment Steward et al., 

1 0 1984, Solid phase peptides synthesis, Pierce Chem. Company, Rockford, 1 1 1, 2eme 
ed., (1984)) ou des techniques utilisant des phases solides partielles, par 
condensation de fragments ou par une synthese en solution classique. Les 
polypeptides obtenus par synthese chimique et pouvant comporter des acides 
amines non naturels correspondant sont egalement compris dans l'invention. 

15 L'invention est en outre relative k des polypeptides hybrides presentant au 

moins un polypeptide ou un de ses fragments selon l'invention, et une sequence 
d'un polypeptide susceptible d'induire une reponse immunitaire chez l'homme ou 
P animal 

Avantageusement, le determinant antigenique est tel qu'il est susceptible 
20 d'induire une reponse humorale et/ou cellulaire. 

Un tel determinant pourra comprendre un polypeptide ou un de ses 
fragments selon l'invention sous forme glycosylee utilise en vue d'obtenir des 
compositions immunogenes susceptibles d'induire la synthase d'anticorps diriges 
contre des epitopes multiples. Lesdits polypeptides ou leurs fragments glycosyles 
25 font egalement partie de l'invention. 

Ces molecules hybrides peuvent etre constitutes en partie d'une molecule 
porteuse de polypeptides ou de leurs fragments selon l'invention, associee a une 
partie eventuellement immunogene, en particulier un Epitope de la toxine 
diphterique, la toxine tetanique, un antigene de surfece du virus de Phepatite B 
30 (brevet FR 79 21811), Pantigene VP1 du virus de la poliomyelite ou toute autre 
toxine ou antigene viral ou bacterien. 

Les procedes de synthase des molecules hybrides englobent les methodes 
utilisdes en g6nie g6n&ique pour construire des sequences nucleotidiques hybrides 
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codant pour les sequences polypeptidiques recherchees. On pourra, par exemple, se 
referer avantageusement a la technique d'obtention de genes codant pour des 
protemes de fusion decrite par Minton en 1984. 

Lesdites sequences nucleotidiques hybrides codant pour un polypeptide 
5 hybride ainsi que les polypeptides hybrides selon 1* invention caracterises en ce qu'il 
s'agit de polypeptides recombinants obtenus par l'expression desdites sequences 
nucleotidiques hybrides, font egalement partie de l'invention. 

L'invention comprend egalement les vecteurs caracterises en ce qu'ils 
contiennent une desdites sequences nucleotidiques hybrides. Les cellules hotes 
10 transformees par lesdits vecteurs, les animaux transgeniques comprenant une 
desdites cellules transformees ainsi que les procedes de preparation de polypeptides 
recombinants utilisant lesdits vecteurs, lesdites cellules transformees et/ou lesdits 
animaux transgeniques font bien entendu egalement partie de l'invention. 

Le couplage entre un polypeptide selon l'invention et un polypeptide 
15 immunogene, peut etre effectue par voie chimique, ou par voie biologique. Ainsi, 
selon l'invention, il est possible d'introduire un ou plusieurs element(s) de liaison, 
notamment des acides amine* pour faciliter les reactions de couplage entre le 
polypeptide selon l'invention, et le polypeptide immunostimulateur, le couplage 
covalent de l'antigene immunostimulateur pouvant etre realist a I'extr^mte N ou 
20 C-terminale du polypeptide selon l'invention. Les reactifs bifonctionnels permettant 
ce couplage sont determines en fonction de l'extremite choisie pour realiser ce 
couplage, et les techniques de couplage sont bien connues de l'homme du metier. 

Les conjugues issus d'un couplage de peptides peuvent etre egalement 
prepares par recombinaison genetique. Le peptide hybride (conjugue) peut en effet 
25 etre produit par des techniques d'ADN recombinant, par insertion ou addition a la 
sequence d'ADN codant pour le polypeptide selon l'invention, d'une sequence 
codant pour le ou les peptide(s) antigene(s), immunogene(s) ou haptene(s). Ces 
techniques de preparation de peptides hybrides par recombinaison genetique sont 
bien connues de I'homme du metier (voir par exemple Makrides, 1996, 
30 Microbiological Reviews 60,512-538). 

De preference, ledit polypeptide immunitaire est choisi dans le groupe des 
peptides contenant les anatoxines, notamment le toxoi'de diphterique ou le toxoide 
tetanique, les proteines derivees du Streptocoque (comme la proteine de liaison a la 
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seralbumine humaine), les proteines membranaires OMPA et les complexes de 
proteines de membranes extemes, les vesicules de membranes externes ou les 
proteines de chocs thermiques. 

Les polypeptides hybrides selon 1' invention sont tres utiles pour obtenir des 
5 anticorps monoclonaux ou polyclonaux, capables de reconnaitre specifiquement les 
polypeptides selon l'invention. En effet, un polypeptide hybride selon l'invention 
permet la potentiation de la reponse immunitaire, contre le polypeptide selon 
l'invention couple a la molecule immunogene. De tels anticorps monoclonaux ou 
polyclonaux, leurs fragments, ou les anticorps chimeriques, reconnaissant les 
1 0 polypeptides selon l'invention, sont egalement objets de l'invention. 

Les anticorps monoclonaux specifiques peuvent Stre obtenus selon la 
methode classique de culture d'hybridome decrite par Kohler et Nfilstein (1975, 
Nature 256, 495). 

Les anticorps selon l'invention sont par exemple des anticorps chimeriques, 
15 des anticorps humanises, des fragments Fab, ou F(ab') 2 II peut egalement se 
presenter sous forme d'immunoconjugue ou d'anticorps marque afin d'obtenir un 
signal detectable et/ou quantifiable. 

Ainsi, les anticorps selon l'invention peuvent Stre employes dans un precede 
pour la detection et/ou l'identification de bacteries appartenant a 1'espece Listeria 
20 monocytogenes ou a un microorganisme associ6 dans un echantillon biologique, 
caracterise en ce qu'il comprend les etapes suivantes : 

a) mise en contact de l'echantillon biologique avec un anticorps selon 
l'invention ; 

b) mise en evidence du complexe antigene-anticorps eventuellement forme. 
25 Les anticorps selon la presente invention sont egalement utilisables afin de 

detecter une expression d'un gene de Listeria monocytogenes ou de 
microorganismes associes. En effet, la presence du produit d'expression d'un gene 
reconnu par un anticorps specifique dudit produit expression peut etre detectee par 
la presence d'un complexe antigene-anticorps forme apres la mise en contact de la 
30 souche de Listeria monocytogenes ou du microorganisme associe avec un anticorps 
selon l'invention. La souche bacterienne utilisee peut avoir ete « preparee », c'est-a- 
dire centrifugee, lysee, placee dans un reactif approprie pour la constitution du 
milieu propice a la reaction immunologique. En particulier, on prefere un procede 
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de detection de P expression dans le gene, correspondant a un Western blot, pouvant 
etre effectue apres une electrophorese sur gel de polyacrylamide d'un lysat de la 
souche bacterienne, en presence ou en Pabsence de conditions reductrices (SDS- 
PAGE). Apres migration et separation des proteines sur le gel de polyacrylamide, 
5 on transfere lesdites proteines sur une membrane appropriee (par exemple en nylon) 
et on detecte la presence de la proteine ou du polypeptide d'interet, par mise en 
contact de ladite membrane avec un anticorps selon Pinvention. 

Ainsi, la presente invention comprend egalement les kits ou necessaires pour 
la mise en oeuvre d'un procede tel que decrit (de detection de Pexpression d'un gene 
10 de Listeria monocytogenes ou d'un microorganisme associe, ou pour la detection 
et/ou Tidentification de bact&ies appartenant a Pespece Listeria monocytogenes ou 
un microorganisme associe), comprenant les elements suivants : 

a) un anticorps polyclonal ou monoclonal selon P invention ; 

b) eventuellement, les reactifs pour la constitution du milieu propice a la 
1 5 reaction immunologique ; 

c) eventuellement, les reactifs permettant la mise en evidence des 
complexes antiggne-anticorps produits par la reaction immunologique. 

Les polypeptides et les anticorps selon Pinvention peuvent avantageusement 
etre immobilises sur un support, notamment une puce a proteines. Une telle puce a 

20 proteines est un objet de Pinvention, et peut egalement contenir au moins un 
polypeptide d'un microorganisme autre que Listeria monocytogenes ou un anticorps 
dirige contre un compose d'un microorganisme autre que Listeria monocytogenes. 

Les puces a proteines ou filtres a haute density contenant des proteines selon 
Pinvention peuvent etre construits de la meme maniere que les puces a ADN selon 

25 Pinvention. En pratique, on peut effectuer la synthese des polypeptides fixes 
directement sur la puce a proteines, ou effectuer une synthese ex situ suivie d'une 
etape de fixation du polypeptide synthetise sur ladite puce. Cette derntere methode 
est preferable, lorsque Pon desire fixer des proteines de taille importante sur le 
support, qui sont avantageusement pr6parees par genie genetique. Toutefois, si Pon 

30 ne desire fixer que des peptides sur le support de ladite puce, il peut etre plus 
interessant de proceder a la synthese desdits peptides directement in situ. 

Les puces a proteines selon Pinvention peuvent etre avantageusement 
utilisees dans des kits ou necessaires pour la detection et/ou Identification de 
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bacteries associees a Tespece Listeria monocytogenes ou a un microorganisme, ou 
de fa9on plus generate dans des kits ou necessaires pour la detection et/ou 
Tidentification de microorganismes. Lorsque Ton fixe les polypeptides selon 
Tinvention sur les puces a ADN, on recherche la presence d'anticorps dans les 
5 echantillons testes, la fixation d'un anticorps selon Tinvention sur le support de la 
puce a proteines permettant identification de la proteine dont ledit anticorps est 
sp£cifique. 

De preference, on fixe un anticorps selon Tinvention sur le support de la 
puce a proteines, et on detecte la presence de Tantigene correspondant, specifique 
10 de Listeria monocytogenes ou d'un microorganisme associe. 

Une puce a proteines ci-dessus decrite peut etre utilisee pour la detection de 
produits de genes, pour etablir un profil d'expression desdits genes, en complement 
d'une puce a ADN selon Tinvention. 

Les puces a proteines selon Tinvention sont egalement extremement utiles 
1 5 pour les experiences de proteomique, qui etudie les interactions entre les differentes 
prolines d'un microorganisme donne. De fafon simplifiee, on fixe des peptides 
representatifs des differentes proteines d'un organisme sur un support. Puis, on met 
ledit support en contact avec des proteines marquees, et apres une etape optionnelle 
de rinyage, on detecte des interactions entre lesdites proteines marquees et les 
20 peptides fixes sur la puce a proteines. 

Ainsi, les puces a proteines comprenant une sequence polypeptidique selon 
Tinvention ou un anticorps selon Tinvention sont objet de Tinvention, ainsi que les 
kits ou necessaires les contenant. 

La presente invention couvre egalement un procede de detection et/ou 
25 d'identification de bacteries appartenant a TespSce Listeria monocytogenes ou k un 
microorganisme assocte dans un echantillon biologique, qui met en oeuvre une 
sequence nucleotidique selon Tinvention. 

II doit etre entendu que le terme echantillon biologique concerne dans la 
presente invention les echantillons preleves k partir d'un organisme vivant (en 
30 particulier sang, tissus, organes ou autres preleves a partir d'un mammifere) ou un 
echantillon contenant du materiel biologique, c'est-a-dire de TADN. Un tei 
echantillon biologique englobe done les compositions alimentaires contenant des 
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bacteries (par exemple les fromages, les produits laitiers), mais egalement des 
compositions alimentaires contenant des levures (bieres, pains) ou autres. 

Le procede de detection et/ou d J identification mettant en ceuvre les 
sequences nucleotidiques selon l'invention peut etre de diverse nature. 

5 

On prefere un procede comportant les etapes suivantes : 

a) eventuellement, isolement de l'ADN a partir de l'echantillon biologique a 

analyser, ou obtention d'un ADNc a partir de TARN de l'echantillon 

biologique ; 

10 b) amplification specifique de l'ADN de bacteries appartenant a Tespece 

Listeria monocytogenes ou a un micro-organisme associe k Taide d'au 
moins une amorce selon 1' invention ; 
c) mise en evidence des produits d'amplification. 

Ce procede est base sur ramplification specifique de l'ADN, en particulier 
15 par une reaction d' amplification en chaTne. 

On prefere egalement un procede comprenant les etapes suivantes ; 

a) mise en contact d'une sonde nucleotidique selon l'invention avec un 
echantillon biologique, Tacide nucleique contenu dans l'echantillon 
biologique ayant, le cas echeant, prealablement ete rendu accessible a 

20 • Thybridation, dans des conditions permettant Thybridation de la sonde a 

l'acide nucleique d'une bacterie appartenant a Tespece Listeria 
monocytogenes ou a un micro-organisme associe ; 

b) mise en evidence de Thybride eventuellement forme entre la sonde 
nucleotidique et l'ADN de l'echantillon biologique. 

25 Un tel procede ne doit pas etre limite a la detection de la presence de l'ADN 

contenu dans l'echantillon biologique atteste, il peut etre egalement mis en oeuvre 
pour d&ecter l'ARN contenu dans ledit Echantillon. Ce procede englobe en 
particulier les Southern et Northern blot. 

Un autre procede prefer^ selon l'invention comprend les Etapes suivantes : 
*° a) mise en contact d'une sonde nucleotidique immobilisEe sur un support 

selon l'invention avec un echantillon biologique, l'acide nucleique de 
Techantillon, ayant, le cas echeant, ete prealablement rendu accessible a 
Thybridation, dans des conditions permettant Thybridation de la sonde a 
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Pacide nucleique d'une bacterie appartenant i l'espece Listeria 
monocytogenes ou a un micro-organisme associe ; 

b) mise en contact de Phybride forme entre la sonde nucleotidique 
immobilisee sur un support et Pacide nucleique contenu dans 

5 Pechantillon biologique, le cas echeant apr6s elimination de PADN de 

Pechantillon biologique n'ayant pas hybride avec la sonde, avec une 
sonde nucleotidique marquee selon Pinvention ; 

c) mise en evidence du nouvel hybride forme a Petape b). 

Ce procede est avantageusement utilise avec une puce i ADN selon 
10 Pinvention, Pacide nucleique recherche s'hybridant avec une sonde presente a la 
surface de ladite puce, et etant detecte par Putilisation d'une sonde marquee. Ce 
proc6d6 est avantageusement mis en oeuvre en combinant une etape prealable 
d'amplification de PADN ou de PADN complementaire obtenu eventuellement par 
transcription inverse, a Paide d'amorces selon Pinvention. 
1 5 Ainsi, la presente invention englobe egalement les kits ou necessaires pour 

la detection et/ou P identification de bacteries appartenant & Pespece Listeria 
monocytogenes ou a un micro-organisme associe, caracterise en ce qu'il comprend 
les elements suivants : 

a) une sonde nucleotidique selon Pinvention ; 
20 b) eventuellement, les reactifs necessaires a la mise en oeuvre d'une 

reaction d 5 hybridation ; 
c) eventuellement, au moins une amorce selon Pinvention ainsi que les 

reactifs necessaires k une reaction d'amplification de PADN. 
De meme, la presente invention englobe egalement les kits ou necessaires 
25 pour la detection et/ou P identification de bacteries appartenant a l'espece Listeria 
monocytogenes ou a un micro-organisme associe, caracterise en ce qu'il comprend 
les elements suivants : 

a) une sonde nucleotidique, dite sonde de capture, selon Pinvention; 

b) une sonde oligonucleotidique, dite sonde de revelation, selon 
30 Pinvention ; 

c) eventuellement, au moins une amorce selon Pinvention ainsi que les 
reactifs necessaires a une reaction d'amplification de PADN. 
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Enfin, les kits ou necessaires pour la detection et/ou r identification de 
bacteries appartenant a Tespece Listeria monocytogenes ou a un micro-organisme 
associe, caracterise en ce qu'il comprend les elements suivants : 
a) au moins une amorce selon Finvention ; 
5 b) eventuellement, les reactifs necessaires pour effectuer une reaction 

d'amplification d'ADN ; 
c) eventuellement, un composant permettant de verifier la sequence du 
fragment amplifie, plus particulierement une sonde oligonucleotidique 
selon Tinvention. 
1 0 sont egalement objets de la presente invention. 

De preference, lesdites amorces et/ou sondes et/ou polypeptides et/ou 
anticorps selon la presente invention utilises dans les procedes et/ou kits ou 
necessaires selon la presente invention sont choisis parmi les amorces et/ou sondes 
et/ou polypeptides et/ou anticorps specifiques de Tespece Listeria monocytogenes. 
15 De maniere preferee, ces elements sont choisis parmi les sequences nucleotidiques 
condant pour une proteine secretee, parmi les polypeptides secretes, ou parmi les 
anticorps diriges contre des polypeptides secretes de Listeria monocytogenes. 

La presente invention a egalement pour objet les souches de Listeria 
monocytogenes et/ou de microorganismes associes contenant une ou plusieurs 
20 mutation(s) dans une sequence nucleotidique selon Tinvention, en particulier une 
sequence ORF, ou leurs elements regulateurs (en particulier promoteurs). 

On prefere, selon la presente invention, les souches de Listeria 
monocytogenes pr&entant une ou plusieurs mutation(s) dans les sequences 
nucleotidiques codant pour des polypeptides impliques dans la machine cellulaire, 
en particulier la secretion, le metabolisme intermediaire central, le metabolisme 
energetique, les processus de synthese des acides amines, de transcription et de 
traduction, de synthese des polypeptides, 

Lesdites mutations peuvent mener a une inactivation du gene, ou en 
particulier lorsqu'elles sont situees dans les elements regulateurs dudit g&ie, a une 
surexpression de celui-ci. 

L'invention concerne en outre Tutilisation d'une sequence nucleotidique 
selon Tinvention, d'un polypeptide selon Tinvention, d'un anticorps selon 
Tinvention, d'une cellule selon Tinvention, et/ou d'un animal transforme selon 
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I'invention, pour la selection de compose organique ou inorganique capable de 
moduler, de reguler, d'induire ou d'inhiber i'expression de genes, et/ou de modifier 
la replication cellulaire de cellules eucaryotes ou procaryotes ou capables d'induire, 
d'inhiber ou d'aggraver les pathologies liees a une infection par Listeria 
5 monocytogenes ou un de ses micro-organismes associes. 

L' invention comprend egalement une methode de selection de composes 
capables de se Her a un polypeptide ou un de ses fragments selon I'invention, 
capables de se lier a une sequence nucleotidique selon I'invention, ou capable de 
reconnaitre un anticorps selon la revendication, et/ou capables de moduler, de 

10 reguler, d'induire ou d'inhiber I'expression de genes, et/ou de modifier la 
croissance ou la replication cellulaire de cellules eucaryotes ou procaryotes, ou 
capables d'induire, d'inhiber ou d'aggraver chez un organisme animal ou humain 
les pathologies liees a une infection par Listeria monocytogenes, ou un de ses 
micro-organismes associes, caracterisee en ce qu'elle comprend les etapes 

1 5 suivantes : 

a) mise en contact dudit compose avec ledit polypeptide, ladite sequence 
nucleotidique, avec une cellule transformee selon I'invention et/ou 
administration dudit compose a un animal transforme selon I'invention; 

b) determination de la capacite dudit compose a se lier avec ledit 
polypeptide ou ladite sequence nucleotidique, ou de moduler, de r6guler, 
d'induire ou d'inhiber I'expression de genes, ou de moduler la 
croissance ou la replication cellulaire, ou d'induire, d'inhiber ou 
d'aggraver chez ledit animal transform^ les pathologies liees k une 
infection par Listeria monocytogenes ou un de ses micro-organismes 
associes. 

Les cellules et/ou les animaux transformes selon I'invention, pourront 
avantageusement servir de modele et etre utilises dans des procedes pour etudier, 
identifier et/ou selectionner des composes susceptibles d'etre responsables de 
pathologies induites ou aggravees par Listeria monocytogenes, ou susceptibles de 
pr6venir et/ou de traiter ces pathologies comme par exemple les maladies g&iitales, 
oculaires ou systemiques, notamment du systeme lymphatique. En particulier, les 
cellules hotes transform&s, notamment les bacteries de la famille des Listerias dont 
la transformation par un vecteur selon I'invention peut par exemple accroitre ou 
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inhiber son pouvoir infectieux, ou moduler les pathologies habituellement induites 
ou aggravees par l'infection, pourront etre utilisees pour infecter des animaux dont 
on suivra Papparition des pathologies. Ces animaux non transformes, infectes par 
exemple avec des bacteries Listerias transformees, pourront servir de modele 
5 d'etude. De la meme maniere, les animaux transformes selon Pinvention pourront 
etre utilises dans des procedes de selection de composes susceptibles de prevenir 
et/ou de traiter les maladies dues k Listeria, Lesdits procedes utilisant lesdites 
cellules transformees et/ou animaux transformes, font partie de 1' invention. 

Les composes susceptibles d'etre selectionnes peuvent etre des composes 
10 organiques tels que des polypeptides ou hydrates de carbone ou tous autres 
composes organiques ou inorganiques d6ja connus, ou des composes organiques 
nouveaux elabores k partir de techniques de modelisation moleculaire et obtenus par 
synthese chimique ou biochimique, ces techniques etant connues de l'homme de 
I'art 

15 Lesdits composes selectionnes pourront etre utilises pour moduler la 

croissance et/ou la replication cellulaire de Listeria monocytogenes ou tout autre 
micro-organisme associe et ainsi pour controler I' infection par ces micro- 
organismes. Lesdits composes selon T invention pourront egalement etre utilises 
pour moduler la croissance et/ou la replication cellulaire de toutes cellules 

20 eucaryotes ou procaryotes, notamment les cellules tumorales et les micro- 
organismes infectieux, pour lesquelles lesdits composes s'avereront actifs, les 
methodes permettant de determiner lesdites modulations etant bien connues de 
Thomme de Tart. 

On entend designer par compose capable de moduler la croissance d'un 
25 micro-organisme tout compose permettant de d'intervenir, de modifier, de limiter 
et/ou de reduire le developpement, la croissance, la vitesse de proliferation et/ou la 
viabilite dudit micro-organisme, 

Cette modulation peut etre realisee par exemple par un agent capable de se 
lier a une proteine et ainsi d'inhiber ou de potentialiser son activity biologique, ou 
30 capable de se lier a une prot&ne membranaire de la surface exterieure d'un micro- 
organisme et de bloquer la penetration dudit micro-organisme dans la cellule hote 
ou de favoriser Taction du systeme immunitaire de Torganisme infecte dirigS a 
l'encontre dudit micro-organisme. Cette modulation peut 6tre egalement realisee par 
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un agent capable de se lier a une sequence nucleotidique d'un ADN ou ARN d'un 
micro-organisme et de bloquer par exemple Texpression d'un polypeptide dont 
I'activite biologique ou structurelle est necessaire k la croissance ou k la 
reproduction dudit micro-organisme. 
5 On entend designer par micro-organisme associe dans la presente invention, 

tout micro-organisme dont l'expression de gene peut etre modulee, regulee, induite 
ou inhib6e, ou dont la croissance ou la replication cellulaire peut etre egalement 
modulee par un compose de Invention. On entend designer egalement par micro- 
organisme associe dans la presente invention, tout micro-organisme comportant des 

10 sequences nucleotidiques ou des polypeptides selon l'invention. Ces micro- 
organismes peuvent dans certains cas comporter des polypeptides ou des sequences 
nucleotidiques identiques ou homologues a celles de l'invention pourront Egalement 
etre detectes et/ou identifies par les procedes ou kit de detection et/ou 
d'identification selon l'invention et egalement servir de cible pour les composes de 

15 l'invention. 

L'invention concerne les composes susceptibles d'etre selectionnes par une 
methode de selection selon l'invention. 

L'invention concerne egalement une composition pharmaceutique 
comprenant un compose choisi parmi les composes suivants : 
20 a) une sequence nucleotidique selon l'invention ; 

b) un polypeptide selon l'invention ; 

c) un vecteur selon l'invention ; 

d) un anticorps selon l'invention ; et 

e) un compose susceptible d'etre selectionne par une methode de selection 
25 selon l'invention, eventuellement en association avec un vehicule 

pharmaceutiquement acceptable. 
On entend designer par quantity efficace, une quantite suffisante dudit 
compose ou anticorps, ou de polypeptide de l'invention, permettant de moduler la 
croissance de Listeria monocytogenes ou d'un micro-organisme associe. 
JO L'invention concerne aussi une composition pharmaceutique selon 

l'invention pour la prevention ou le traitement d'une infection par une bacterie 
appartenant k I'espece Listeria monocytogenes ou par un micro-organisme associe. 
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L'invention vise en outre une composition immunogene et/ou vaccinaie, 
caract&isee en ce qu'elle comprend un ou plusieurs polypeptides selon l'invention 
et/ou un ou plusieurs polypeptides hybrides selon F invention. 

L'invention comprend aussi l'utilisation d'une cellule transformee selon 
5 l'invention, pour la preparation d'une composition vaccinaie. 

L'invention vise egalement une composition vaccinaie, caracterisee en ce 
qu'elle contient une sequence nucleotidique selon l'invention, un vecteur selon 
l'invention et/ou une cellule transformee selon l'invention. 

L'invention concerne egalement les compositions vaccinales selon 
10 l'invention, pour la prevention ou le traitement d'une infection par une bacterie 
appartenant a l'espece Listeria monocytogenes ou par un micro-organisme associe. 

De maniere preferee, les compositions immunogenes et/ou vaccinales selon 
Tinvention destinees a la prevention et/ou au traitement d'infection par Listeria 
monocytogenes ou par un micro-organisme associe seront choisies parmi les 
1 5 compositions immunogenes et/ou vaccinales comprenant un polypeptide ou un de 
ses fragments correspondant i une proteine, ou un de ses fragments, de l'enveloppe 
cellulaire de Listeria monocytogenes. Les compositions vaccinales comprenant des 
sequences nucleotidiques comprendront de preference egalement des sequences 
nucleotidiques codant pour un polypeptide ou un de ses fragments correspondant a 
20 une proteine, ou un de ses fragments, de l'enveloppe cellulaire de Listeria 
monocytogenes. 

Parmi ces compositions immunogenes et/ou vaccinales preferees, les plus 
preferees sont celles comprenant un polypeptide ou un de ses fragments, ou une 
sequence nucleotidique ou un de ses fragments dont les sequences sont choisies 
25 parmi les sequences nucleotidiques ou d'acides amin6s identifies dans ce groupe 
fonctionnel et listees pr&^demment. 

Les polypeptides de Tinvention ou leurs fragments entrant dans les 
compositions immunogenes selon Tinvention peuvent etre selectionnes par des 
30 techniques connues de Thomme de l'art comme par exemple sur la capacity desdits 
polypeptides a stimuler les cellules T, qui se traduit par exemple par Ieur 
proliferation ou la secretion d'interleukines, et qui aboutit a la production 
d'anticorps diriges contre lesdits polypeptides. 
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Chez la souris, chez laquelle une dose ponderale de la composirion vaccinale 
comparable a la dose utilisee chez Phomme est administree, la reaction anticorps est 
testee par prelevement du serum suivi d'une etude de la formation d'un complexe 
entre Ies anticorps presents dans le serum et Pantigene de la composition vaccinale, 
5 selon les techniques usuelles. 

Selon Pinvention, lesdites compositions vaccinates seront de preference en 
association avec un vehicule pharmaceutiquement acceptable et, le cas echeant, 
avec un ou plusieurs adjuvants de Pimmunite appropries. 

Aujourd'hui, divers types de vaccins sont disponibles pour proteger 

10 Phomme contre des maladies infectieuses : micro-organismes vivants attenues (M 
bovis - BCG pour la tuberculose), micro-organismes inactives (virus de la grippe), 
des extraits acellulaires (Bordetella pertussis pour la coqueluche), proteines 
recombinees (antigene de surface du virus de Phepatite B), des polyosides 
(pneumocoques). Des vaccins prepares a partir de peptides de synthese ou de micro- 

1 5 organismes gen&iquement modifies exprimant des antigenes heterologues sont en 
cours d'experimentation. Plus recemment encore, des ADNs plasmidiques 
recombines portant des genes codant pour des antigenes protecteurs ont ete 
proposes comme strategic vaccinale alternative. Ce type de vaccination est realise 
avec un plasmide particulier dfrivant d'un plasmide de K coli qui ne se replique pas 

20 in vivo et qui code uniquement pour la proteine vaccinante. Des animaux ont ete 
immunises en injectant simplement PADN plasmidique nu dans le muscle. Cette 
technique conduit k Pexpression de la proline vaccinale in situ et a une reponse 
immunitaire de type cellulaire (CTL) et de type humoral (anticorps). Cette double 
induction de la reponse immunitaire est Pun des principaux avantages de la 

25 technique de vaccination avec de PADN nu. 

Les compositions vaccinales comprenant des sequences nucleotidiques ou 
des vecteurs dans lesquels sont inserees lesdites sequences, sont notamment decrites 
dans la demande internationale N° WO 90/11092 et egalement dans la demande 
internationale N° WO 95/1 1307. 

30 La sequence nucleotidique constitutive de la composition vaccinale selon 

Pinvention peut etre injectee 4 Phote aprds avoir ete couplee 4 des composes qui 
favorisent la penetration de ce polynucleotide a Pinterieur de la cellule ou son 
transport jusqu'au noyau cellulaire. Les conjugues resultants peuvent etre 
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encapsutes dans des microparticules polymeres, comrae decrit dans la demande 
internationale N° WO 94/27238 (Medisorb Technologies International). 

Selon un autre mode de realisation de la composition vaccinale selon 
Pinvention, la sequence nuclfotidique, de preference un ADN, est complexee avec 
5 du DEAE-dextran, avec des proteines nucleaires, avec des lip ides ou encapsulee 
dans des liposomes ou encore introduite sous la forme d'un gel facilitant sa 
transfection dans les cellules. Le polynucleotide ou le vecteur selon l'invention peut 
aussi etre en suspension dans une solution tampon ou etre associe a des liposomes. 
Avantageusement, un tel vaccin sera prepare conformement a la technique 

10 decrite par Tacson et al. ou Huygen et al. en 1996 ou encore conformement a la 
technique decrite par Davis et al. dans la demande internationale N° WO 95/1 1307. 

Un tel vaccin peut etre egalement prepare sous la forme d'une composition 
contenant un vecteur selon Pinvention, placee sous le contrSle d' Elements de 
regulation permettant son expression chez Phomme ou Panimal. On pourra par 

15 exemple utiliser, en tant que vecteur d' expression in vivo de Pantigene 
polypeptidique d'interet, le plasmide pcDNA3 ou le plasmide pcDNAl/neo, tous les 
deux commercialises par Invitrogen (R&D Systems, Abingdon, Royaume-Uni). 
Un tel vaccin comprendra avantageusement, outre le vecteur recombinant, une 
solution saline, par exemple une solution de chlorure de sodium. 

20 On entend designer par vehicule pharmaceutiquement acceptable, un 

compose ou une combinaison de composes entrant dans une composition 
pharmaceutique ou vaccinale ne provoquant pas de reactions secondares et qui 
permet par exemple la facilitation de l'administration du compose actif, 
Paugmentation de sa duree de vie et/ou de son efficacite dans l'organisme, 

25 Paugmentation de sa solubilite en solution ou encore Pamelioration de sa 
conservation. Ces vehicules pharmaceutiquement acceptables sont bien connus et 
seront adaptes par Thomme de Tart en fonction de la nature et du mode 
d'administration du compose actif choisi. 

En ce qui concerne les formulations vaccinates, celles-ci peuvent 

30 comprendre des adjuvants de Timmunite appropries qui sont connus de Thomme de 
Part, comme par exemple 1'hydroxyde d'aluminium, un representant de la famille 
des muramyl peptides comme un des derives peptidiques du N-acetyl-muramyl, un 
lysat bact^rien, ou encore Padjuvant incomplet de Freund. 
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De preference, ces composes seront administres par voie systemique, en 
particulier par voie intraveineuse, par voie intramusculaire, intradermique ou sous- 
cutanee, ou par voie orale. De maniere plus preferee, la composition vaccinale 
comprenant des polypeptides selon l'invention, sera administree a plusieurs 
5 reprises, de maniere etalee dans le temps, par voie intradermique ou sous-cutanee. 

Leurs modes d'administration, posologies et formes galeniques optimaux 
peuvent etre determines selon les criteres generalement pris en compte dans 
Petablissement d'un traitement adapte a un patient comme par exemple Fige ou le 
poids corporel du patient, la gravite de son etat general, la tolerance au traitement et 
1 0 les effets secondares constates. 

L'invention comprend Putilisation d'une composition selon l'invention, 
pour le traitement ou la prevention de maladies genitales, induites ou aggravees par 
Listeria monocytogenes. 

Enfin, l'invention comprend Putilisation d'une composition selon 
15 l'invention, pour le traitement ou la prevention de maladiesinduites ou aggravees 
par la presence de Listeria monocytogenes. 

Enfin, Pinvention comprend Putilisation d'une composition selon 
l'invention, pour le traitement ou la prevention de maladies systemiques, 
notamment du systeme lymphatique, induites ou aggravees par la presence de 
20 Listeria monocytogenes, 

Par ailleurs, la presente invention a egalement pour objet une banque 
d'ADN genomique d'une bacterie du genre Listeria, de maniere preferee, Listeria 
monocytogenes, de mantere preferee la souche EGD-e, ladite banque d'ADN etant 
clonee dans des chromosomes artificiels de bacteries (BAC). Une telle banque 
25 d'ADN genomique contient de tr£s larges inserts du genome de Listeria, en 
particulier des inserts d'une taille comprise entre 50 et 200 kb. 

Un des avantages de Putilisation du systeme de BAC par rapport a un 
systeme de cosmides est que le plasmide utilise n'est present qu'a une ou au 
maximum deux copies par cellule transformee, ce qui reduit le potentiel pour la 
30 recombinaison entre des fragments d'ADN, et de fa9on plus importante, ce qui 
^limine le risque de surexpression letale de genes clones bacteriens. Neanmoins, la 
presence du BAC en tant que simple copie signifie que P ADN plasmidique doit etre 
extrait d'un large volume de culture afin d'obtenir suffisamment d'ADN pour la 
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sequence. De plus, la stabilite et la fidelite de maintenance des clones dans une 
banque de BAC permettent Tidentification de differences genomiques parmi 
differentes souches de listeria, et Identification de ces differences genetiques qui 
peuvent etre responsables des variations phenotypiques observees entre les 
5 differentes souches. 

La banque d'ADN genomique decrite dans la presente invention, en 
particulier la banque LM_baclim deposee k la CNCM le 1 1 avril 2000 sous le 
numero d'ordre n°I-2439 recouvre en effet le genome de Listeria monocytogenes. 
Toutefois, bien que certaines regions n'aient pas pu Stre donees dans ladite banque, 
10 en raison de problemes de letalites chez Escherichia coli, ces regions peuvent 
facilement 6tre amplifiees et identifies par l'homme du metier, en utilisant des 
oligonucleotides specifiques des sequences des extremites des differents clones qui 
forment les contigs. 

La presente invention concerne egalement les methodes pour l'isolement 
15 d'un polynucleotide d'interet present chez une souche de Listeria et absente chez 
une autre souche, qui utilise au moins une banque d'ADN basee sur un BAC, 
contenant le genome de Listeria. La methode selon Finvention pour Fisolement 
d'un polynucleotide d'interet peut comprendre les etapes suivantes : 

a) isoler au moins un polynucleotide contenu dans un clone de la banque 
20 d'ADN basee sur un BAC, d'origine de listeria, 

b) isoler : 

- au moins un polynucleotide genomique ou ADNc d'une listeria, ladite 
listeria appartenant a une souche differente de la souche utilisee pour la 
construction de la banque d'ADN BAC de l'etape a) ou, de fa?on 

25 alternative, 

- au moins un polynucleotide contenu dans un clone d'une banque d'ADN 
basee sur un BAC prepare a partir du genome d'une listeria qui est 
differente de la listeria utilisee pour la construction de la banque d'ADN 
basee sur le BAC de l'etape a). 

30 c) hybrider le polynucleotide de l'etape a) au polynucleotide de l'etape b) ; 

d) selectionner les polynucleotides de l'etape a) qui n'ont pas form6 de 
complexe d'hybridation avec les polynucleotides de l'etape b) ; 

e) caracteriser le polynucleotide selectionne. 
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On peut preparer le polynucleotide de Petape a) par la digestion d'au moins 
un clone recombinant BAC avec une enzyme de restriction appropriee, et de fa?on 
optionnelle, ^amplification de l'insert polynucleotide qui en resulte. 

Ainsi, la methode de l'invention permet a l'homme du metier d'effectuer des 
5 etudes genomiques comparatives entre les differentes souches ou especes du genre 
listeria, par exemple entre les souches pathogeniques et leurs equivalant non 
pathog&ies. 

En particulier, il est possible d'etudier et de determiner les regions de 
polymorphisme entre lesdites souches. 

10 

EXEMPLES 

Exemple 1 : Fabrication de la banque BAC d'ADN At Listeria monocytogenes 

Les blocs contenant 1'ADN chromosomique de Listeria monocytogenes, 
d'un poids moyen de 80 mg ont ete prepares dans V agarose, par des methodes 
15 connues de l'homme du metier. lis ont ete gardes dans une solution de 500 mM 
EDTA, pH8,0. 

On utilise 8 blocs pour la construction de la banque. Ces blocs d'agarose 
sont laves deux fois pendant 30 min sur de la glace dans du TE/PMSF (10 mM Tris- 
HC1, pH8,0 ; 1 mM EDTA ; 100 ^M PMSF). Puis, on lave les blocs d'agarose deux 

20 fois pendant 30 min sur de la glace dans du TE (10 mM Tris-HCI, pH8,0 ; 1 mM 
EDTA). Chaque bloc d'agarose est coupe en huit fines tranches, et seize tranches 
sont mises ensemble dans un seul tube. Les blocs sont alors incub6s dans une 
solution de pre-incubation (20 jxM Spermidine, 5 mM DTT, tampon de restriction 
lx), pendant 40 min sur de la glace. On change le tampon de pre-incubation une 

25 fois et on Pincube de nouveau pendant 40 min. 

Les digestions partielles sont effectuees dans un tampon de digestion (2 mM 
Spermidine, 0,5 mM DTT, 0,02 \xg BSA, tampon de restriction 0,5x) pendant 30 
min k 4°C, sans enzyme de restriction, puis on ajoute pendant 2 h & 37°C, 0,1 ; 
0,25 ; 0,4 ou 0,5 d'unite d'EcoRI (life Tech) par tube. La digestion partielle est 

30 arretee par remplacement du tampon de digestion par 200 (il EDTA 0,5 M, les tubes 
&ant places sur de la glace. 
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Les blocs d'agarose sont alors places sur un gel d' agarose 1 % SeaKem 
GTG (FMC), 0,5x tampon TBE. On effectue alors le gel d'&ectrophorese en 
champs pulse avec les conditions suivantes : 

temps de pulsation initial : 90 secondes, 
5 temps de pulsation final : 90 secondes, 

6 V/cm, 

angle inclus : 120 °, 
dureedegel: 16 h, 12°C. 

On coupe la region comprise entre 50 et 200 kb & partir du gel, et on la 
10 separe en trois morceaux (50-100 kb, 100-150 kb, 150-200 kb). Le gel pour le 
clonage ne doit pas etreteinte avec le bromure d'etidium. 

Ces blocs d'agarose sont places dans un nouveau gel d'agarose 1 % SeaKem 
GTG (FMC), 0,5x tampon TBE. On effectue un nouveau gel d'SIectrophorese en 
champs pulse avec les conditions suivantes : 
1 5 temps initial de pulsation : 5 secondes, 

temps final de pulsation : 5 secondes, 
4 V/cm, 

angle inclus : 120°, 

dureedegel: 15 h, 12°C. 
20 On decoupe de nouveau, sans avoir teinte 1'ADN avec du bromure 

d'etidium, la region comprise entre 50 et 200 kb et on la separe en trois morceaux 
(50-100 kb, 100-150 kb, 150-200 kb). 

Les morceaux d'agarose sont coupes en petits morceaux d'environ 100 mg 

chacun. 

25 L'agarose est incube a 67°C pendant 10 min puis refroidi a 42°C, et on 

ajoute 1 |al de Beta-Agarase (FMC, lU/pd). On incube pendant 30 min & 42°C, et on 
effectue une denaturation de la Beta-Agarase apr6s digestion complete de Pagarose 
par incubation pendant 10 min a 67°C, puis incubation sur glace. 

150 ng de vecteurs d'ADN dig6r6s par EcoRI et des phophoryles (CIP, 

30 Roche) sont utilises pour la construction de la banque de B AC. On utilise le vecteur 
pBelaBAC-Kan (Mozo et aL, Mol. Gen. Genet., 1998, 258, 562-70) pour la 
construction de Iadite banque. On incube done 150 ng dudit vecteur avec 150 ng 
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d' inserts d'ADN, dans un tampon de ligation lx avec 5 unites de Ligase (USB 1 
unite/fal) pendant 16 h a 12°C. 

Le tampon de ligation est le tampon recommande par le fabricant. 
La transformation est effectuee en ajoutant 5 |al de la reaction de ligation a 
5 40 |il de cellules electrocompetentes DH10B, et l'electroporation est effectuee dans 
un electroporateur Life Tech avec les conditions suivantes : 

- 330 piF, 4 kfl, DC Volts :low£2, taux de charge : rapide, cuvette 

d'electroporation 1,5 mm. 
On ajoute 1 mm de milieu de culture SOC aux cellules et on incube pendant 
10 45 min a 37°C. On etale ensuite les cellules sur une boite contenant du LB agar et 
qui contient egalement 30 g/ml de kanamicine, 100 p.g/ml d'DPTG et 40 ng/ml d'X- 
Gal. On selectionne les cellules recombinantes qui sont blanches par rapport aux 
cellules bleues, et on caracterise la banque de BAC en effectuant des cartes de 
restriction de chacun des clones ainsi obtenus. 
15 Exemple 2: Annotations et analyse de la sequence genomique de Listeria 
monocytogenes 

La bio-informatique a un role cle dans les trois phases d'un projet genome : 
suivi des inserts r6alis6s par la methode de sequen?age aleatoire, phase de finition 
de la sequence genome, et annotations. Les Inventeurs ont developpe un logiciel 

20 complet qui permet de repondre a ces trois exigences : GMP-Tool-box (GMPTB). 

Suivi : Durant la procedure de sequen9age aleatoire, GMPTB extrait des 
fichiers resultats (format Phrap) toutes les caracteristiques necessaires a 
l'assemblage (nombre de contigs, nombre de sequences...) et les affiche dans un 
tableau. Ce tableau peut etre utilise pour creer des graphiques qui montrent la 

25 progression de cette methode et qui permettent une identification rapide des 
problemes d'assemblage. De fa^on importante, GMPTB permet la comparaison 
entre les resultats d' assemblage et cree une page HTML afin de decrire la relation 
entre les nouveaux et les anciens contigs (fusion, creation.,.). 

Phase de finition de la sequence : DifFerentes strategies sont utilisees par 

30 GMPTB pour pr^dire les liaisons entre les contigs. GMPTB recherche en particulier 
tous les clones permettant les liaisons, sur la base de la localisation et de 
l'orientation des sequences terminales. II peut aussi indiquer les misassemblages. 
GMPTB peut aussi predire les liaisons, sur la base de comparaisons de genomes, en 
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recherchant les similarites entre les extremites des contigs et d'autres sequences 
genomiques (haut niveau nucleotidique et des acides amines). 

Annotation s : GMPTB permet de commencer P annotation durant la phase 
terminale. De fait, GMPTB cree un fichier individuel de prolines (IPF), pour 
5 chaque phase ouverte de lecture (ORF) lors de Passemblage. Ce sont des fichiers 
texte, dans un format specifique qui contient trois categories de champs : 

- les champs minimums contiennent un num&o d' identification, un 
numero de version, la localisation et les sequences. La sequence 
nucleotidique exportee correspond a la sequence de la phase ouverte de 

1 0 lecture avec 500 bases additionnelles avant le premier codon stop et 200 

bases additionnelles apres le second stop. 

- le champ automatique contient des resultats ajoutes par differents 
programmes a PDPF. II concerne la sequence d'ADN (recherche de sites 
de liaisons de ribosomes, promoteurs ou terminateurs, capacite 

1 5 codante. . . ) et la sequence proteique predite (homologie, domaine. . . ). 

- le champ manuel contient des resultats et des commentaires ajoutes par 
les utilisateurs. Apr6s un nouvel assemblage, GMPTB extrait toutes les 
ORF et cree de nouveaux IPF selon les sequences IPF derives des 
assemblages precedents. GMPTB reconnaTt les IPF modifies, qui sont 

20 les seuls utilises pour une nouvelle analyse automatique aprfes chaque 

assemblage. 

La specificite de cette strategie est que Pannotation des IPF est independante 
de Petape de Passemblage, a moins que sa sequence ne soit modifiee. Les IPF sont 
relies a une banque de donnee g^nomique Sybases (du modele SubtiList) et sont 
25 accessibles grace a Pintermediaire d'un serveur web. lis peuvent etre modifies et 
annot^s par differents Inventeurs durant la phase de projet de genome. 
Depot de materiel biologique 

Les organismes suivants ont ete deposes le 11 Avril 2000 a la Collection 
Nationale de Cultures de Microorganismes (CNCM), 25 rue du Docteur Roux, 
30 75724 Paris Cedex 15, France, selon les dispositions du Traite de Budapest : 

- Souche de Listeria monocytogenes EGD-e, Numero d'ordre 1-2440 ; 

- Banque BAG d'ADN de Listeria (145 clones), LM-baclib, NumSro d'ordre I- 
2439. Ladite banque BAC (1-2439) a ete realisee dans la souche Kcoli DH10B 
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(Grant et aL, PNAS, 87, 4645, 1990), construite apres digestion partielle de 
PADN de Listeria monocytogenes par 1' enzyme EcoRI dans le vecteur 
pBeiaB AC-Kan (Mozo et aL, Mol. Gen. Genet., 1998, 258, 562-70). 



5 TABLEAU 1 



SeqID Nom Localisationde la sequence 

nucl6ique de I'ORF sur la 
SEQ ID N° 1 

SeqID n° 2 LM-1000.1 From 589066 to 589362 



SeqID n° 3 LM-1 002. 1 From 587012 to 589033 

SeqID n° 4 LM-1050.1 From 2907153 to 2909708 

SeqID n° 5 LM-1 179.1 From 761552 to 763468 

SeqID n° 6 LM-1 18.2 From 2787416 to 2788363 

SeqID n° 7 LM-1235.1 From 875721 to 881855 

SeqID n p 8 LM-1248.1 From 865530 From 865530 

SeqID n° 9 LM-1248.1 From 865530 From 865530 

SeqID n° 10 LM-1305.1 From 919020 to 920408 

SeqID n° 1 1 LM-1490.1 From 649864 to 651633 

SeqID n° 12 LM-1514.1 From 664242 to 668990 



SeqID n° 13 LM-1660.3 From 501312 to 501617 
SeqID n° 14 LM-1738.1 From 547520 to 549337 

SeqID n° 15 LM-1756.2 From 2679599 to 2681125 



SeqID n° 16 LM-1778.1 From 1442368 to 1443687 



SeqID n° 17 LM-1972.3 From 1313654 to 1315435 



Fonction 



Unknown, pepdidoglycan 
bound protein (LPXTG 
motif) 
Unknown, similar to 
internalin protein 
Unknown, LPXTG protein 
with LRR repeats 
Unknown, putative 
peptidoglycan bound 
protein (LPXTG motif) 
Unknown, peptidoglycan 
anchored protein (LPXTG 
motif) 

Unknown, surface protein 
(LPXTG motif) 

unknown, surface protein 
(LPXTG motif) 

unknown, surface protein 
(LPXTG motif) 

Unknown, similar to wall 
associated protein 

precursor (LPXTG motif) 
Unknown, similar to 
internalin proteins 

Unknown, peptidoglycan 

bound protein (LPXTG 

motif) similar to adhesin 
Unknown 
Unknown, similar to 
internalin proteins 
Unknown, surface protein 

(GW repeat) similar to N- 
acetylmuramidase 
Unknown, putative 
peptidoglycan bound 

protein (LPXTG motif) 
Unknown, similar to 
internalin proteins 
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SeqID n° 18 


■ ■ « A A J\ A^\_ 

LM-1 974.3 


From 1315767 


to 1317563 


Unknown, similar to 
internalin proteins 


SeqID n° 19 


LM-2137.2 


From 360936 


to 366272 


Unknown, similar to 
internalin proteins 


SeqID n° 20 


. LM-229.1 


From 1170002 


to 1171621 


Unknown, putative 
interanlin, similar to InIA 


SeqID n° 21 


LM-2323.1 


From 344850 


to 346049 


Unknown, similar to 
surface proteins 


SeqID n° 22 


LM-2435.1 


From 159663 


to 161378 


unknwon, surface 
anchored protein (LPXTG 


SeqID n° 23 








motif) 


LM-2438.1 


From 157089 


to 159470 


unknwon, surface 
anchored protein (LPXTG 
motif) 


SeqID n° 24 


LM-2503.1 


From 2108500 


to 2109603 


Unknown, putative cell 
surface protein, similar to 
internalin proteins 


SeqID n° 25 


LM-2504.1 


From 2106329 


to 2108209 


Unknown, putative 
peptidoglycan bound 
protein (LPXTG motif) 


SeqID n° 26 


LM-3009.3 


From 1149887 


to 1152475 


unknown, similar to 
fibrinogen-binding protein 
(LPXTG motif) 


SeqID n° 27 


LM-311.2 


From 131419 


to 133773 


Unknwon, similar to 5 - 
nucleotidase (LPXTG 
motif) 


SeqID n° 28 


LM-3144.1 


From 351459 


to 355505 


Unknown, similar to cell 
surface proteins (LPXTG 
motif) 


SeqID n° 29 


LM-3369.1 


From 1869532 


to 1872243 


Unknown, putative 
peptidoglycan bound 
protein (LPXTG motif) 


SeqID n° 30 


LM-3418.2 


From 1717193 


to 1722328 


unknown, peptidoglycan 
linked protein (LPxTG) 


SeqID n 6 31 


LM-3477.1 


From 828168 


to 830108 


Unknown, similar to 
internalin 


SeqID n° 32 


LM-3609.1 


From 1106041 


to 1107759 


unknown, similar to 
AUTOLYSIN(EC 3.5.1.28) 
(N-ACETYLMURAMOYL- 
L-ALANINE AMIDASE) 


SeqID n 33 


LM-3691.2 


From 2162323 


to 2164011 


Unknown, putative 
peptidoglycan bound 
protein (LPXTG motif) 


SeqID n° 34 


LM-3700.2 


From 2653211 


to 2657803 


Unknwon, peptidoglycan 
anchored protein (LPXTG 
motif) 


SeqID n° 35 


LM-3752.3 


From 357775 


to 359676 


Unknown, similar to 
internalin 


SeqID n° 36 


LM-757.1 


From 2544267 


to 2545433 


Unknown, similar to 
internalin proteins 
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LM-814.2 


From 2264772 


to 2268230 


SeqID n° 38 


LM-816.2 


From 2259753 


to 2264591 


SeqID n 39 


LM-894.1 


From 2469093 


to 2471915 


SeqID n° 40 


LM-966.1 


From 173309 


to 174556 


SeqID n° 41 


LM-973.1 


From 169510 


to 172008 


SeqID n° 42 


LM-133.1 


From 1230773 


to 1232308 


SeqID n° 43 


LM-134.1 


From 1229967 


to 1230773 


SeqID n° 44 


LM-135.1 


From 1229277 


to 1229954 


SeqID n° 45 


LM-136.1 


From 1228994 


to 1229290 


SeqID n° 46 


LM-137.1 


From 1228263 


to 1228997 


Sea ID n° 47 


LM-138 1 


From 1227556 


to 


SeqID n° 48 


LM-139.1 


From 1226778 


to 1227563 


SeqID n° 49 


LM-141.1 


From 1225300 


to 1226781 


SeqID n° 50 


LM-142.1 


From 1224548 


to 1225300 


SeqID n 51 


1 R A A A *S A 

LM-143.1 


From 1223826 


to 1224551 


SeqID n° 52 


LM-144.1 


From 1222798 


to 1223829 


SeqID n° 53 


LM-145.1 


From 1222062 


to 1222811 


SeqID n° 54 


LM-146.1 


From 1221487 


to 1222056 


SeqID n° 55 


LM-147.1 


From 1220901 


to 1221497 



Unknown, putative 
peptidoglycan bound 
protein (LPXTG motif) 
Unknown, putative 
peptidoglycan bound 
protein (LPXTG motif) 
Unknown, similar to 
internalin proteins 
Unknown, cell wall 
anchored protein 
Unknwon, similar to 
internalin proteins 
Unknown, similar to cobyric 

acid synthase CbiP 
Unknown, similar to cobalt 
transport ATP-binding 
protein CbiO 
Unknown, similar to cobalt 
transport protein Q 
Unknown, similar to 
putative cobalt transport 
protein CbiN 
Unknown, similar to 
cobalamin biosynthesis 

protein M 
Unknown, similar to S- 
adenosyl-methionine: 
precorrin-2 
methyltransferase 
unknown, similar to 
anaerobic Cobalt 
Chelatase In Cobalamin 
Biosynthesis 
Unknown, similar 
uroporphyrinogen-III 
methyltransferase/uroporph 
yrinogen-lll synthase 

Unknown, similar to 
cobalamin biosynthesis J 
protein CbiJ 
Unknown, similar to 
precorrin methylase 
Unknown, similar to 
cobalamin biosynthesis 
protein G CbiG 
Unknown, similar to 
precorrin-3 methylase 
Unknown, similar to 
precorrin decarbocylase 
Unknown, similar to 
precorrin methylase 
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SeqlDn°56 LM-149.1 From 1219783 to 12209CM 

# 

SeqID n° 57 LM-150.1 From 1219135 to 1219767 

SeqID n° 58 LM-151.1 From 1218175 to 1219122 



SeqID n° 59 LM-152.1 From 1216963 to 1217322 
SeqID n° 60 LM-181.1 From 1196097 to 1197182 



SeqID n° 61 LM-207.1 From 1180152 to 1181036 

SeqID n° 62 LM-209.1 From 1179168 to 1179743 

SeqID n° 63 LM-210.1 From 1178421 to 1179167 

SeqID n° 64 LM-212.1 From 1 177862 to 1 178419 



SeqID n° 65 LM-1 .3 From 2710869 to 2712755 

SeqID n° 66 LM-10.1 From 2718312 to 2719055 

SeqID n° 67 LM-1 00. 1 From 2775644 to 2776486 

SeqID n° 68 LM-1003.1 From 586329 to 586994 

SeqID n° 69 LM-1004.1 From 585015 to 585962 



SeqID n° 70 LM-1005.1 From 583507 to 584757 



SeqID n° 71 LM-1007.1 From 583072 to 583452 
SeqID n° 72 LM-1008.1 From 582526 to 583047 



SeqID n° 73 LM-1 009.1 From 581519 to 582505 



SeqID n° 74 LM-1 01 . 1 From 2776532 to 2776777 



Unknown, similar to 
cobalamin biosynthesis 
protein CbiD 
Unknown, similar to 
precorrin isomerase 
Unknown, similar to 
cobalamine synthesis 
protein CbiB 
FALSE ORF 
unknown, similar to 
Salmonella typhimurium 
CobD protein and to 
histidinol-phosphate 
aminotransferase 
Regulatory protein similar 
to Salmonella typhimurium 

PocR protein 
unknown, similar to alpha- 
ribazole-5 -phosphatase 
unknown, highly similar to 

cobalamin (5 - 
phosphatase) synthetase 
unknown, similar to 
bifunctional cobalamin 
biosynthesis protein CopB, 

(cobinamide kinase; 
cobinamide phosphatase 

guanylyltransferase) 
Unknown, similar to NADH 
dehydrogenase 
Unknown 
Unknown, similar to 
aldo/keto reductase 

Unknown 
Unknown, similar to B. 

subtilis DeoR 
transcriptional regulator 
Unknown, similar to 
putative NAD(P)- 
dependent oxidoreductase 

Unknown 
Unknown, similar to PTS 
system, glucitol/sorbitol- 
specific enzyme II CM 
component 
Unknown, similar to PTS 
system, glucitol/sorbitol- 
specific enzyme IIBC 

component 
unknown, similar to B. 
subtilis YaaL protein 
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SeqID n° 75 


LM-1010.1 


From 581150 


to 581 500 


Unknown, similar to PTS 
system, glucitol/sorbitol- 
speciflc enzyme IIA 


SeqID n° 76 








component 


LM-1011.1 


From 580119 


to 581039 


Unknown, similar to ABC 
transporter (binding 


SeqID n° 77 








protein) 


LM-1012.1 


From 578886 


to 580079 


Unknown, similar to 


SeqID n 9 78 








penicillin-binding protein 


LM-1013.1 


From 577632 


to 578648 


Unknown, similar to 
tagatose-1 ,6-diphosphate 


SeqID n° 79 








aldolase 


LM-1015.1 


From 576403 


to 577584 


Unknown, similar to N-acyl- 
L-amino acid 


SeqID n° 80 


LM-1017.1 






amidohydrolase 


From 575166 


to 576437 


Unknown, similar to N- 
carbamyl-L-amino acid 


SeqID n° 81 








amidohydrolase 


LM-1018.1 


From 573740 


to 575062 


Unknown, similar to 6- 


SeqID n° 82 








phospho-beta-glucosidase 


LM-102.1 


From 2776792 


to 2777388 


unknown, highly similar to 
recombination protein recR 


SeqID n° 83 


LM-1 020.1 


From 572600 


to 573568 


Unknown, similar to 
transcription regulator (Lacl 


SeqID n° 84 








family) 


LM-1 022.1 


From 571204 


to 572559 


Unknown, similar to 


SeqID n° 85 








unknown proteins 


LM-1 023.1 


From 570916 


to 571185 


Unknown, similar to 


SeqID n° 86 








unknown proteins 


LM-1 024.1 


From 570008 


to 570838 


Unknown 


SeqID n° 87 


LM-1025.1 


From 569094 


to 569948 


Unknown 


SeqID n° 88 


LM-1 029.1 


From 567030 


to 569081 


Unknown 


SeqID n° 89 


LM-1 03.1 


From 2777497 


to 2777814 


Unknown, highly similar to 
B. subtilis YaaK protein 


SeqID n° 90 


LM-1 031.1 


From 565764 


to 567014 


Unknown, conserved 
hypothetical protein similar 
to putative 


SeqID n° 91 








glucosaminyltransferase 


LM-1 032.1 


From 564257 


to 565753 


Unknown, hypothetical 


SeqID n° 92 








secreted protein 


LM-1 033.1 


From 562789 


to 564264 


Unknown, transmembrane 


SeqID n° 93 








protein 


LM-1036.1 


From 561819 


to 562559 


Unknown, similar to 
transcription regulator 
(TipA from Streptomyces 


SeqID n° 94 








coelicolor) 


LM-1 037.1 


From 560443 


to 561774 


Unknown 


SeqID n° 95 


LM-1 040.2 


From 2915744 


to 2916097 


Unknown 


SeqID n° 96 


LM-1 041 .2 


From 2915219 


to 2915641 


Unknown similar to 
transcriptional regulator 
(MarR family) 
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SeqlDn°97 LM-1043.1 From 2914057 to 2915205 

SeqlDn°98 LM-1045.1 From 2912595 to 2913686 

SeqlDn°99 LM-1047.1 From 2911415 to 2912602 

SeqID n° 100 LM-1048.1 From 2910085 to 291 1326 



SeqID n° 101 LM-1049.1 From 2909753 to 2910076 
SeqID n° 102 LM-1052.1 From 2904598 to 2906940 



SeqID n° 103 LM-1054.1 From 2903200 to 2904402 

SeqID n° 104 LM-1 055.1 From 2901800 to 2903200 

SeqID n° 105 LM-1056.1 From 2900594 to 2901784 

SeqID n° 106 LM-1057.1 From 2899313 to 2900506 

SeqID n° 107 LM-1058.1 From 2898535 to 2899266 

SeqID n° 1 08 LM-1 059. 1 From 2897847 to 2898374 

SeqID n° 109 LM-1060.1 From 28971 16 to 2897823 

SeqID n° 110 LM-1061.1 From 2896241 to 2897059 

SeqID n° 111 LM-1064.1 From 2894510 to 2895883 

SeqID n° 112 LM-1066.1 From 2892596 to 2894485 

SeqID n° 1 13 LM-1067.2 From 2892057 to 2892437 

SeqID n° 114 LM-1069.3 From 434945 to 435817 

SeqID n° 1 15 LM-107.1 From 2777844 to 2779583 

SeqID n° 1 16 LM-1 070.1 From 433153 to 434745 



SeqID n° 1 17 LM-1074.1 From 429630 to 432095 

SeqID n° 118 LM-1075.1 From 428981 to 429403 

SeqID n° 119 LM-1076.1 From 428576 to 428968 

SeqID n° 120 LM-1077.1 From 428127 to 428507 

SeqID n° 121 LM-1079.1 From 427104 to428111 



Unknown, similar to efflux 

proteins 
unknown, highly similar to 
phosphoserine 
aminotransferase 
unknown, similar to D-3- 
phosphoglycerate 
dehydrogenase 
unknown, similar to an 
hypothetical protein from 
Thermotoga maritima 
unknown 
Unknown, amino-terminal 

domain similar to 
transcription regulators 
unknown, similar to 
carboxypeptidase 
Unknown, similar to 
transmembrane efflux 
protein 
unknown, similar to 

peptidases 
unknown, similar to 
transport protein 
Unknown, similar to 

reductases 
Unknown, similar to 
transcriptional regulator 

Unknown 
unknown, similar to D- 
alanyl-D-alanine 
carboxypeptidase 
Unknown, similar to 
GTPase 
unknown, highly similar to 
GidA protein 
Unknown, hypothetical 
secreted protein 
Unknown 
unknown, highly similar to 

DNA polymerase III 
(gamma and tau subunits) 
Unknown, similar to 
phosphoenolpyruvate 
synthase (N-terminal part) 
internalin 
Unknown 
Unknown 
Unknown, similar to B. 
subtilis YyaH protein 
Unknown, similar to 
phosphate transport protein 
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SeqID n° 122 

SeqID n° 124 
SeqID n° 125 


LM-108.1 
( M man 1 

LM-1081.1 
LM-1082.1 


From 2779910 

Prom A9R^Q7 

From 426190 
From 424130 


to 2780374 
to 4^osyo 
to 426606 
to 426064 


SeqID n° 126 


LM-1083.1 


From 421469 


to 424096 


SeqID n° 127 


LM-1084.1 


From 420335 


to 421447 


SeqID n° 128 


LM-1085.1 


From 419999 


to 420331 


SeqID n° 129 


LM-1088.1 


From 419544 


to 420002 


SeqID n°130 


LM-1089.1 


From 418891 


to 419373 


SeqID n°131 


LM-109.1 


From 2780933 


to 2782297 


SeqID n° 132 


LM-1 091.1 


From 417963 


to 418763 


SeqID n° 133 


LM-1 092.1 


From 417535 


to 417960 


SeqID n° 134 


LM-1093.1 


From 416796 


to 417479 


SeqID n°135 
SealD n° 136 


LM-1 094.1 
LM-1 097 1 

L.IVI 1 \J C I.I 


From 416203 
From 415994 


to 416652 

to 41fi1fift 


SeqID n° 137 
SeqID n°138 


LM-1 098.1 
LM-1 099.1 


From 414928 
From 414095 


to 415227 
to 414760 


SeqID n° 139 
SeqID n° 140 


LM-1 1.1 
LM-1 100.1 


From 2719056 
From 412849 


to 2719775 
to 413964 


SeqID n° 141 
SeqID n° 142 


LM-1 101.1 
LM-1 102.1 


From 412457 
From 411992 


to 412825 
to 412363 


SeqID n° 143 


LM-1 105.1 


From 409944 


to 41 1860 


SeqID n° 144 


LM-1 107.1 


From 408954 


to 409931 



Unknown 
Unknown 
Unknown 
Unknown, similar to 
transcriptional 
antiterminator (BgIG family) 
Unknown, highly similar to 

E. col YbgG protein, a 
putative sugar hydrolase 
Unknown, similar to 

fructose-specific 
phosphotransferase 

enzyme IIC 
Unknown, similar to 

fructose-specific 
phosphotransferase 

enzyme IIB 
Unknown, similar to 
phosphotransferase 
system enzyme HA 
Unknown, similar to 
unknown proteins 
Unknown, similar to PTS 
system, cellobiose-specific 
enzyme IIC 
Unknown, similar to 1- 
pyrroline-5-carboxylate 
reductase (ProC) 
Unknown, weakly similar to 
blasticidin S- 
acetyltransferase 
Unknown, similar to L. 

monocytogenes 
extracellular P60 protein 

Unknown 
Unknown, highly similar to 
B. subtilis YqfA protein 
Unknown 
Unknown, similar to uracil- 
DNA glycosylase 
Unknown 
Unknown, low temperature 
requirement protein A 

Unknown 
Unknown, similar to B. 
subtilis YhdG protein 
Unknown, similar to B. 
subtilis lolD protein, to 
acetolactate synthase 
Unknown, similar to B. 
subtilis lolC protein and to 
fructokinase 
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SeqID n° 145 


LM-1 108.1 


From 408116 


to 408937 


SeqID n° 146 


LM-1 11 0.2 


From 406637 


to 408103 


SeqID n° 147 


LM-1 112.2 


From 405680 


to 406441 


SeqID n° 148 
SeqID n° 149 
SeqID n° 150 


LM-1 114.1 
LM-1 115.3 
LM-1 116.4 


From 405221 
From 404365 
From 807976 


to 405607 
to 404994 
to 810792 


SeqID n° 151 


LM-1 117.1 


From 807255 


to 807689 


SeqID n° 152 


LM-1 118.1 


From 806770 


to 807255 


SeqID n° 153 
SeqID n° 154 


LM-1 12.1 
LM-1 121.1 


From 2783466 
From 805784 


to 2784107 
to 806596 


SeqID n° 155 


LM-1 122.1 


From 804887 


to 805765 


SeqID n°156 

OoqiLS n lOf 

SeqID n° 158 
ocJt|iL-' n io?7 
SeqID n° 160 


LM-1 123.1 

1 M i 10£ -i 
LIVI- 1 IZQ.l 

LM-1 126.1 

LIVI- 1 1 Z 1 . 1 

LM-1 128.1 


From 804382 
rrom oVo/hd 
From 803201 
rrom ou^ooo 
From 801336 


to 804729 
to 804212 
to 803548 
to 802 7o9 
to 802202 


SeqID n° 161 
SeqID n° 162 


LM-1 129.1 
LM-1 13.1 


From 800965 
From 2784354 


to 801297 
to 2784674 


SeqID n° 163 


LM-1 130.1 


From 799996 


to 800925 


SeqID n° 164 


LM-1 131.1 


From 798828 


to 799817 


SeqID n° 165 


LM-1 132.1 


From 798141 


to 798788 


SeqID n° 166 


LM-1 133.1 


From 797353 


to 798105 



Unknown, similar to B. 
subtilis lolB protein 
Unknown, highly similar to 
B. subtilis methylmalonate- 
semialdehyde 
dehydrogenase lolA 
Unknown, similar to B. 
subtilis transcription 
repressor of myo-inositol 
catabolism operon loIR 
Unknown 
Unknown 
Unknown, similar to 
transcriptional regulator 
(NifA/NtrC family) 
Unknown, similar to 
mannose-specific 
phosphotransferase 
system (PTS) component 
MA 

Unknown, similar to 
mannose-specific 
phosphotransferase 
system (PTS) component 
IIB 
Unknown 
Unknown, similar to 
mannose-specific 
phosphotransferase 
system (PTS) component 
IIC 

Unknown, similar to 
mannose-specific 
phosphotransferase 
system (PTS) component 
IID 
Unknown 
Unknown 
Unknown 
Unknown 
Unknown, similar to 
transcription regulator 
(repressor) 
Unknown 
unknown, similar to 
hypothetical proteins 
Unknown, conserved 
hypothetical protein 
Unknown, similar to alcohol 
dehydrogenase 
Unknown, similar to 
transcription regulator 
Unknown 
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SeqlDn°167 LM-1 134.1 From 796075 to 797193 



SeqID n° 168 LM-1 135.1 From 795034 

SeqID n e 1 69 LM-1 1 36. 1 From 793777 

SeqID n e 170 LM-1 137.1 From 792879 

SeqID n° 171 LM-1 138.1 From 791977 



SeqID n° 172 
SeqID n°173 

SeqID n° 174 

SeqID n° 175 

SeqID n° 176 

SeqID n° 177 

SeqID n° 178 
SeqID n° 179 
SeqID n° 180 
SeqID n° 181 



LM-1 139.1 
LM-1 14.1 

LM-1 140.1 

LM-1 141.1 

LM-1 142.1 

LM-1 143.1 

LM-1 144.1 
LM-1 146.1 
LM-1 147.1 
LM-1 149.1 



From 790662 
From 2784805 

From 789520 

From 788642 

From 787320 

From 786372 

From 785754 
From 784788 
From 783901 
From 782794 



to 796062 
to 795030 

to 793754 
to 792867 

to 791960 
to 2786319 

to 790509 

to 789514 

to 788576 

to 786992 

to 786362 
to 785747 
to 784788 
to 783774 



SeqID n° 182 LM-1 151.1 From 781896 to 782801 



SeqID n° 183 LM-1 152.1 From 781103 

SeqID n° 184 LM-1 153.2 From 780377 

SeqID n° 185 LM-1 154.1 From 779616 

SeqID n° 1 86 LM-1 1 55.1 From 778654 



SeqID n° 187 
SeqID n°188 
SeqID n° 189 
SeqID n°190 
SeqID n°191 
SeqID n° 192 



LM-1 156.1 
LM-1 157.1 
LM-1 158.1 
LM-1 159.1 
LM-1 160.1 
LM-1 161.1 



From 778262 
From 777692 
From 777209 
From 776761 
From 776398 
From 775744 



to 781924 
to 780988 

to 780296 

to 779490 



to 778558 
to 778207 
to 777496 
to 777102 
to 776697 
to 776247 



Unknown, similar to 
transcriptional regulator 
(Lacl family) 
Unknown, similar to alpha- 

1 ,6-mannanase 
Unknown, similar to sugar 

ABC transporter, 
periplasmic sugar-binding 

protein 
Unknown, similar to ABC 
transporter, permease 
protein 
Unknown, similar to 
putative sugar ABC 
transporter, permease 
protein 
Unknown 
Unknown, highly similar to 
gluconate kinase 
Unknown, similar to . 
lipoate-protein ligase 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
ATP/GTP-binding protein 
Unknown, similar to 
unknown proteins 
Unknown 
Unknown 
Unknown 
Unknown, similar to 
hypothetical proteins 
Unknown; Similar to ABC 
transporter (ATP-binding 
protein) 
unknown, similar to 
unknown proteins 
Unknown, weakly similar to 
a bile acid 7-alpha 

dehydratase 
unknown, similar to 
transcription regulator 
Crp/Fnr family 
Unknown, weakly similar to 
a putative haloacetate 
dehalogenase 
Unknown 
Unknown 
Unknown 
unknown 
Unknown, hypothetical 
Unknown 
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SeqID n° 193 


LM-1 164.1 


From 773715 


to 775715 


SeqID n° 194 
oeqiu n iyo 


LM-1 165.1 
Livi-nob.i 


From 772996 
From 772310 


to 773700 
to 772999 


SeqID n° 196 


LM-1 167.1 


From 771949 


to 772326 


SeqID n° 197 


LM-1 168.1 


From 771026 


to 771685 


SeqID n° 198 


LM-1 169.1 


From 769639 


to 771012 


SeqID n 6 199 


LM-117.1 


From 2786435 


to 2787373 


SeqID n° 200 


LM-1 171.1 


From 767766 


to 769619 


oeqiu n /ui 
SeqID n° 202 


LM-1 1 f Z.l 

LM-1 173.1 


Horn 7ooo10 
From 766351 


to 767742 
to 766797 


SeqID n° 203 


LM-1 174.1 


From 765683 


to 766354 


SeqID n° 204 


LM-1 175.1 


From 764465 


to 765469 


SeqID n° 205 


LM-1 176.1 


From 763798 


to 764307 


SeqID n° 206 
SeqID n° 207 
SealD n° 208 
SeqID n° 209 


LM-1 181.1 
LM-1 182.2 
LM-1 183 2 

I— (VI f lUv/tfc 

LM-1184.3 


From 760455 
From 759972 

From 7*>CM7Q 

II will f v/57*t/ \7 

From 2916291 


to 760859 
to 760310 

f/s 7CQQCO 

to 2916890 


Renin n° 910 


1— IVI™ 1 1 OD. 1 


Prom 


to ^yi /ZOO 


SeqID n° 211 


LM-1189.1 


From 2917389 


to 291 8051 


SeqID n° 212 
SeqID n° 213 


LM-1 19.2 
LM-1 190.1 


From 1239842 
From 2918048 


to 1240201 
to 2919169 


SeqID n° 214 


LM-1 195.1 


From 2919166 


to 2921343 


SeqID n" 215 


LM-1 196.1 


From 2921340 


to 2922371 


SeqID n° 216 


LM-1 198.1 


From 2922429 


to 2923223 



Unknown, similar to ABC 
transporter, ATP-binding 
protein 
Unknown 
Unknown, similar to ABC 
transporter, ATP-binding 
protein 
Unknown, similar to 
transcriptional regulator 
(GntR family) 
Unknown, similar to 
putative transcription 

regulator 
Unknown, similar to 6- 
phospho-beta-glucosidase 
Unknown, secreted protein 
with 1 GW repeat 
Unknown, similar to 
phosphotransferase 
system (PTS) beta- 
glucoside-specific enzyme 
IIABC component 
Unknown 
Unknown, similar to ribose 
5-phosphate isomerase 

Unknown, similar to 
Ribulose-5-Phosphate 3- 
Epimerase 
Unknown, similar to 
transcriptional regulator 
(Lad family) 
Unknown, similar to 
transcription regulator 
Unknown 
Unknown 
Unknown 
Unknown, similar to yeast 
protein Frm2p involved in 
fatty acid signaling 
Unknown, similar to 

thioredoxin 
Unknown, similar to 
phosphoglucomutase 

unknown 
unknown; similar to 
unknown proteins 
Unknown, similar to a 
maltose phosphorylase 
Unknown, similar to 
oxidoreductases 
Unknown, highly similar to 
an E. coli protein 
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SeqID n° 217 LM-1 199.1 From 2923236 to 2924288 Unknown, similar to alcohol 

dehydrogenase 

SeqID n° 218 LM-1201.1 From 2924307 to 2925158 Unknown, similar to sugar 

ABC transporter permease 
protein 

SeqID n° 219 LM-1202.1 From 2925145 to 2926026 Unknown, similar to sugar 

ABC transporter permease 
protein 

SeqID n° 220 LM-1203.1 From 29261 01 to 2927393 Unknown, similar to sugar 

binding protein (ABC 
transporter) 

SeqID n° 221 LM-1205.1 From 2927390 to 2928631 Unknown, similar to 

Sucrose phosphorylase 

SeqID n° 222 LM-1206.1 From 2928668 to 2929090 unknown, weakly similar to 

sucrose phosphorylase 

SeqID n° 223 LM-1208.1 From 2929315 to 2930340 Unknown, similar to 

transcriptional regulator 

SeqID n° 224 LM-1209.1 From 2930479 to 2932473 unknown 

SeqID n° 225 LM-121.1 From 1239039 to 1239797 unknown, similar to rRNA 

methylase 

SeqID n° 226 LM-1210.1 From 2932609 to 2933148 Unknown, similar to 

unknown proteins 

SeqID n° 227 LM-1212.1 From 2933303 to 2934715 unknown, similar to 

transmembrane efflux 
proteins 

SeqID n° 228 LWM213.1 From 2934756 to 2935070 Unknown, highly similar to 

B. subtilis YulD protein 
SeqID n° 229 LM-1214.1 From 2935083 to 2935904 unknown, highly similar to 

rhamnulose-1-phosphate 
aldolase 

SeqID n° 230 LM-1216.1 From 2935917 to 2937179 unknown, highly similar to 

L-rhamnose isomerase 
SeqID n° 231 LM-1218.1 From 2937192 to 2938643 unknown; similar to 

rhamnulokinase 

SeqID n° 232 LM-122.1 From 1237934 to 1239013 unknown, similar to endo- 

1,4-beta-gIucanase and to 
aminopeptidase 

SeqID n° 233 LM-1220.1 From 2938662 to 2939930 Unknown, similar to sugar 

transport proteins 

SeqID n° 234 LM-1221.1 From 2940053 to 2941033 Unknown, similar to AraC- 

type regulatory protein 
SeqID n° 235 LM-1222.1 From 2941083 to 2941544 unknown 
SeqID n° 236 LM-1223.2 From 2941599 to 2942219 unknown, highly similar to 

B. subtilis Jag protein 

SeqID n° 237 LM-1 224.3 From 2942216 to 2943079 Unknown, highly similar to 

B. subtilis SpollfJ protein 
SeqID n° 238 LM-1225.2 From 884170 to 885270 Unknown, similar to 

excinuclease ABC, chain C 
(UvrC) 

SeqID n° 239 LM-1228.1 From 882962 to 884065 Unknown, similar to B. 

subtilis YxjH and YxjG 
proteins 
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SeqID n° 240 LM-123. 1 From 1236836 to 1237822 

SeqID n° 241 LM-1230.1 From 882328 to 882705 

SeqID n° 242 LM-1231 .1 From 881992 to 882249 

SeqID n° 243 LM-1239.1 From 872616 to 875258 



SeqID n° 244 LM-1241.1 From 871940 to 872410 
SeqID n° 245 LM-1242.1 From 870587 to 871807 



SeqID n° 246 LM-1 243. 1 From 869095 to 870480 

SeqID n° 247 LM-1245.1 From 867163 to 868800 

SeqID n° 248 LM-1246.1 From 866577 to 866990 

SeqID n° 249 LM-1249.1 From 864791 to 865537 

SeqID n° 250 LM-125.1 From 1235670 to 1236539 

SeqID n° 251 LM-1250.1 From 863798 to 864688 

SeqID n° 252 LM-1251 .1 From 863322 to 863606 

SeqID n° 253 LM-1254.1 From 861731 to 862642 

SeqID n° 254 LM-1255.1 From 859656 to 861617 

SeqID n° 255 LM-1257.1 From 855759 to 859406 

SeqID n° 256 LM-1 258.1 From 854745 to 855581 

SeqID n° 257 LM-1259.1 From 854443 to 854748 

SeqID n° 258 LM-1261 .1 From 852704 to 854338 

SeqID n° 259 LM-1262.1 From 851225 to 852505 

SeqID n° 260 LM-1263.1 From 850216 to 851184 

SeqID n° 261 LM-1265.1 From 849314 to 850138 

SeqID n° 262 LM-1266.1 From 848804 to 849193 

SeqID n° 263 LM-1267.1 From 848126 to 848788 



unknown, similar to N- 
acetylmuramoyl-L-alanine 
amidase (autolysin) 
unknown, conserved 
hypothetical protein 
unknown, similar to B. 
subtilis protein YsdA 
unknown, similar to cation 
(clacium) transporting 
ATPase 
Unknown 
unknown, similar to 
Tetracycline resistance 
protein 
unknown, highly similar to 
hexose phosphate 
transport protein 
Unknown, ABC transporter 
(ATP binding protein) 
unknown, similar to B. 
subtilis YrkR protein 

unknown 
unknown, similar to N- 
acetylmuramoyl-L-alanine 
amidase (autolysin) 
Unknown; similar to 
transcriptional regulator 
unknown, similar to 
transposase 
unknwon 
unknown, highly similar to 
fructose-1,6- 
bisphosphatase 
unknown, highly similar to 
pyruvate-flavodoxin 

oxidoreductase 
unknown, similar to 

transposases 
unknown, similar to 

transposases 
Unknown, similar to 

transport protein 
unknown, similar to 3- 
hydroxy-3-methylglutaryl- 
coenzyme a reductase 
unknown 
unknown, similar to 
oxydoreductases 
unknown, similar to 
transcriptional regulators 
unknown 
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SeqID n° 264 


LM-1 268.1 


From 847552 


to 848109 


SeqID n° 265 
oeqiu n zoo 


LM-1269.1 
LM-1 271 .Z 


From 846594 
Prom 843949 


to 847442 
to 846579 


Onrtin r-i° OC7 

oeqiu n 20/ 


LM-IZ/o.1 


From 939106 


to 93981 9 


SeqID n° 268 


LM-1 275.2 


From 937739 


to 939025 


SeqID n° 269 
oeqiu n z/u 


LM-1 277.2 

1 IV yi 4 070 1 


From 937202 
rrom aoDiob 


to 937594 
10 9oOb03 


SeqID n° 271 


LM-128.1 


From 1233904 


to 1234434 


SeqID n° 272 


LM-1283.1 


From 933959 


to 936136 


SeqID n° 273 


LM-1 285.1 


From 932257 


to 933882 


SeqID n° 274 


LM-1286.1 


From 931451 


to 932050 


SeqID n° 275 


LM-1 287.1 


From 930671 


to 931450 


SeqID n° 276 


LM-1 288.1 


From 930220 


to 930693 


SeqID n° 277 


LM-1 289.1 


From 929892 


to 930236 


SeqID n° 278 


LM-129.1 


From 1233366 


to 1233815 


SeqID n° 279 


LM-1290.1 


From 928738 


to 929742 


oeqiu n zou 


Livi-izyi.i 


rrom y^ooii 


to 928721 


SeqID n° 281 


LM-1 293.1 


From 927952 


to 928308 


SeqID n° 282 


LM-1294.1 


From 927110 


to 927946 


SeqID n° 283 


LM-1 295.1 


From 926511 


to 926858 


SeqID n° 284 


LM-1296.1 


From 926229 


to 926507 


SeqID n° 285 


LM-1297.1 


From 924917 


to 926023 


SeqID n° 286 


LM-1298.1 


From 924542 


to 924898 



unknown, some similarity 
to acatyitransferases 
unknown 
unknown, similar to cation 
transporting ATPase 
Unknown, similar to 
transcription regulator 

(GntR family) 
unknown, similar to PTS 
system, cellobiose-specific 
IIC component 
Unknown 
unknown, similar to B. 
subtilis YdcK protein 
unknown, similar to 
unknown proteins 
unknown, conserved 
hypothetical protein 
unknown, similar to 
transport proteins 
Indirect negative regulation 
of sigma B dependant gene 
expression (serine 
phosphatase) 
RNA polymerase sigma-37 

factor (sigma-B) 
sigma-B activity negative 
regulator RsbW 
anti-anti-sigma factor 
(antagonist of RsbW) 
unknown, similar to 
unknown proteins 
unknown, highly similar to 
serine phosphatase RsbU 
unknown, highly similar to 
positive regulation of 
sigma-B activity 
unknown, highly similar to 
negative regulation of 
sigma-B activity 
unknown, highly similar to 
positive regulator of sigma- 
B activity 
Unknown, similar to B. 
subtilis YdcE protein 
Unknown, similar to B. 
subtilis YdcD protein 
Unknown, similar to alanine 

racemase 
Unknown, similar to holo- 
acyl-carrier protein 
synthase 
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SeqID n° 287 LM-13.1 From 2719735 
SeqID n° 288 LM-1 30.1 From 1232884 
SeqID n° 289 LM-1300.1 From 923161 



SeqID n° 290 

SeqID n° 291 

SeqID n° 292 
SeqID n° 293 
SeqID n° 294 



LM-1302.1 From 921537 

LM-1303.1 From 921062 

LM-1 304.1 From 920452 

LM-1306.1 From 918135 

LM-1307.1 From 917164 



to 2720484 
to 1233351 
to 924540 

to 923012 

to 921544 

to 920928 
to 918902 
to 918117 



SeqID n° 295 LM-1309.1 From 916436 to 917164 

SeqID n° 296 LM-1 31.1 From 1232272 to 1232838 

SeqID n° 297 LM-1 31 2.1 From 915090 to 916415 

SeqID n° 298 LM-1313.1 From 914735 to 915070 

SeqID n° 299 LM-1314.1 From 914405 to 914734 

SeqID n° 300 LM-1316.1 From 912372 to 914390 

SeqID n° 301 LM-1317.1 From 910998 to 912164 

SeqID n° 302 LM-1318.1 From 910517 to 910870 

SeqID n° 303 LM-1319.1 From 910173 to 910484 

SeqID n° 304 LM-1320.1 From 909159 to 910157 

SeqID n° 305 LM-1 321 .1 From 908499 to 909056 

SeqID n° 306 LM-1 322.1 From 907979 to 908467 

SeqID n° 307 LM-1 325.1 From 905962 to 907524 

SeqID n° 308 LM-1326.1 From 903837 to 905510 

SeqID n° 309 LM-1327.1 From 902773 to 903840 

SeqID n° 310 LM-1328.1 From 901837 to 902751 

SeqID n° 31 1 LM-1329.1 From 900303 to 901835 



Unknown, similar to 
creatinine amidohydrolase 
unknown, similar to 
unknown proteins 
unknown, similar to 
protoporphyrinogen IX and 
coproporphyrinogen III 

oxidase (HemY) 
Unknown, similar to B. 
subtilis YbtB protein 
unknown, similar to B. 
subtilis YdbS protein 
unknown 
unknown 
unknown, similar to 
oxidoreductases 
unknown, similar to B. 
subtilis NagB protein 
(glucosamine-6-phosphate 
isomerase) 
unknown, similar to 
unknown protein 
unknown, similar to PTS 
system, Lichenan-specific 
enzyme IIC component 
unknown, similar to PTS 
system, beta-glucoside 
enzyme IIB component 
Unknown, similar to PTS 
system enzyme IIA 

component 
Unknown; Similar to 
transcriptional regulator 
(antiterminator) 
Unknown; similar to 
antibiotic resistance protein 

Unknown, similar to B. 
subtilis YtcD protein 
unknown 
unknown 
unknown 
unknown 
Unknown, similar to ATP- 
dependent RNA helicase 

Unknown, similar to 
phosphomannomutase 
unknown 
unknown 
unknown, similar to oligo- 
1,6-glucosidase 
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SeqID n° 312 LM-1 331.1 From 899454 to 900287 

SeqID n° 313 LM-133Z1 From 898589 to 899467 

SeqID n° 314 LM-1 333.3 From 897273 to 898592 

SeqID n° 315 LM-1335.3 From 1104048 to 1104851 

SeqID n° 316 LM-1337.2 From 1102858 to 1103757 

SeqID n° 317 LM-1338.1 From 1099266 to 1102706 

SeqID n° 318 LM-1 339.1 From 1097786 to 1098988 

SeqID n° 319 LM-1341.1 From 1097322 to 1097603 

SeqID n° 320 LM-1342.1 From 1096874 to 1097116 

SeqID n° 321 LM-1343.2 From 1095975 to 1096835 

SeqID n° 322 LM-1 349. 1 From 1 093965 to 1 095803 

SeqID n° 323 LM-1351.1 From 1093000 to 1093773 

SeqID n° 324 LM-1352.1 From 1092255 to 1092869 

SeqID n° 325 LM-1353.1 From 1091110 to 1092054 

SeqID n° 326 LM-1354.1 From 1090375 to 1091043 

SeqID n° 327 LM-1357.1 From 1088941 to 1090362 

SeqID n° 328 LM-1358.1 From 1087409 to 1088854 

SeqID n° 329 LM-1 359. 1 From 1 086772 to 1 087434 

SeqID n° 330 LM-1360.1 From 1086100 to 1086630 



unknown, similar to sugar 
ABC transporter, permease 

protein - 
unknown, similar to sugar 
ABC transporter, permease 
protein 
Unknown, similar to 
putative sugar ABC 
transporter, periplasmic 
sugar-binding protein, 
unknown, highly similar to 
teichoic acid translocation 
permease protein TagG 
unknown, similar to metal 

binding protein 
unknown, highly similar to 

pyruvate carboxylase 
unknown, similar to cell- 
division protein RodA and 
FtsW 

unknown, similar to B. 
subtilis YlaN protein 
unknown, similar to B. 
subtilis Ylal protein 
unknown 
unknown, similar to GTP- 
binding ptotein TypA/BipA 
(tyrosine phosphorylated 
protein A) from E. coli and 
B. subtilis (YlaG) 
unknown, similar to 
extragenic suppressor 
protein SuhB and to myo- 
inositol-! (or 4)- 
monophosphatase 
unknown, similar to B. 
subtilis YktB protein 
unknown, similar to 
membrane and transport 

proteins 
unknown, similar to ABC 
transporter (permease) 
unknown, conserved 
hypothetical protein 
unknown, similar to sensor 
protein histidine kinases (2 
components regulatory 
systems) 

unknown, similar to 
transcription response 
regulator 
unknown 
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SeqID n° 331 

SeqlD n° 332 

SeqID n° 333 
SeqID n° 334 

SeqID n° 335 

SeqID n° 336 
SeqID n° 337 
SeqID n° 338 

SeqID n° 339 
SeqID n° 340 

SeqID n° 341 

SeqID n° 342 

SeqID n° 343 

SeqID n° 344 

SeqID n° 345 

SeqID n° 346 

SeqID n° 347 



LM-1361.1 

LM-1 363.1 

LM-1364.1 
LM-1367.1 

LM-1368.1 

LM-1 369.1 
LM-1 370.1 
LM-1 371.1 

LM-1 372.1 
LM-1 373.1 

LM-1 374.1 

LM-1 376.1 

LM-1 377.1 

LM-1 379.1 

LM-1380.1 

LM-1 381.1 

LM-1 383.1 



62 

From 1085806 

From 1084853 

From 1084413 
From 1082822 

From 1081183 

From 1080095 
From 1078977 
From 1077592 

From 1076989 
From 1075860 

From 1075360 

From 1074324 

From 1073813 

From 1073552 

From 1073146 

From 1072664 

From 1071462 



to 1086078 

to 1085806 

to 1084724 
to 1084225 

to 1082817 

to 1081072 
to 1080092 
to 1078143 

to 1077543 
to 1076858 

to 1075848 

to 1075325 

to 1074295 

to 1073800 

to 1073568 

to 1073149 

to 1072685 



unknown, similar to B. 
subtilis YktA protein 
unknown, similar to L- 
lactate dehydrogenase 
unknown 
unknown, highly similar to 

dihydrolipoamide 
dehydrogenase, E3 subunit 
of pyruvate dehydrogenase 

complex 
unknown, highly similar to 
pyruvate dehydrogenase 
(dihydrolipoamide 
acetyltransferase £2 
subunit) 
unknown, highly similar to 
pyruvate dehydrogenase 

(E1 beta subunit) 
unknown, highly similar to 
pyruvate dehydrogenase 
(E1 alpha subunit) 
unknown, similar to 
formylmethionine 
deformylase and to B. 
subtilis YkrB protein 
unknown, similar to B. 
subtilis YdfE protein 
Unknown, similar to 
molybdopterin biosynthesis 
protein MoeB 
unknown, similar to 
molybdenum cofactor 
biosynthesis protein B 
unknown, similar to 
molybdenum cofactor 
biosynthesis protein A 
unknown, similar to 
molybdenum cofactor 
biosynthesis protein C 
unknown, similar to 
molybdopterin converting 
factor (subunit 1). 
Unknown, similar to 
molybdopterin converting 
factor, subunit 2 
unknown, similar to 
molybdopterin-guanine 
dinucleotide biosynthesis 
MobB 
unknown, similar to 
molybdopterin biosynthesis 
protein moeA 
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SeqID n° 348 LM-1 385.1 From 1070597 to 1071367 

SeqlDn <> 349 LM-1 386.1 From 1069821 to 1070492 

SeqID n° 350 LM-1 387.1 From 1069156 to 1069818 

SeqID n° 351 LM-1388.1 From 1068594 to 1069175 

SeqlDn°352 LM-1389.1 From 1067723 to 1068526 



SeqlDn°353 LM-1 390.1 From 1066397 
SeqlDn°354 LM-1 391.1 From 1064521 



SeqID n° 355 
SeqID n° 356 
SeqID n° 357 
SeqID n° 358 



LM-1392.1 From 1063061 

LM-1394.1 From 1062108 

LM-1396.1 From 1061291 

LM-1398.3 From 1059872 



to 1067662 
to 1066377 



to 1064524 
to 1063064 
to 1062115 
to 1061275 



SeqID n° 359 
SeqID n° 360 


LM-1401.1 
LM-1 402.1 


From 235524 
From 234414 


to 237020 
to 235409 


SeqID n° 361 


LM-1 404.1 


From 233858 


to 234337 


SeqID n° 362 


LM-1 405.1 


From 233491 


to 233865 


SeqID n° 363 


LM-1 407.1 


From 232662 


to 233477 


SeqID n° 364 


LM-1408.1 


From 230840 


to 231766 


SeqID n° 365 


LM-1410.1 


From 229840 


to 230724 


SeqID n° 366 


LM-1 41 1.1 


From 229045 


to 229824 


SeqID n° 367 


LM-1 41 3.1 


From 226854 


to 228929 


SeqID n° 368 


LM-1 41 5.1 


From 224562 


to 226508 



unknown, similar to 
molybdate binding protein 
Unknown, similar to 
molybdenum transport 

protein 
unknown, similar to ABC 

transporter 
unknown, weakly similar to 
molybdopterin-guanine 
dinucleotide biosynthesis 

protein A 
unknown, highly similar to 
B. subtilis YoaT protein 
unknown 
unknown, similar to 
phosphotransferase 
system (PTS) beta- 
glucoside-specific enzyme 
IIABC 
unknown, similar to 

glycerol kinase 
unknown, similar to 

transketolase 
unknown, similar to 

transketolase 
unknown, similar to 
hypothetical proteins 
lysyl-tRNA synthetase 
Unknwon, conserved 
hypothetical protein 
unknown, similar to 7,8- 
dihydro-6- 
hydroxymethylpterin 
pyrophosphokinase 
unknown, highly similar to 
dihydroneopterin aldolase 
unknown, highly similar to 
dihydropteroate synthases 
unknown, highly similar to 
cysteine synthase 
Unknwon, conserved 
hypothetical protein 
Unknwon, conserved 
hypothetical protein 
unknown, highly similar to 
cell division protein ftsH 
Unknown, fusion protein, 
N-terminal part similar to B. 
subtilis YacA protein, O 
terminal part similar to 
hypoxanthine-guanine 
phosphoribosyltransferase 
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SeqlDn 8 369 LM-1418.1 From 224027 to 224455 

SeqID n° 370 LM-1420.1 From 223490 to 223876 

SeqlDn°371 LM-1426.1 From 221370 to 222959 

SeqlDn°372 LM-1429.1 From 217800 to 221339 

SeqlDn°373 LM-1430.1 From 217125 to 217685 

SeqlDn°374 LM-1432.1 From 216434 to 217018 

SeqID n° 375 LM-1433.2 From 215721 to 216344 

SeqlDn 6 376 LM-1434.2 From 214486 to 215427 



SeqID n° 377 
SeqID n° 378 



LM-1435.1 
LM-1436.1 



From 213735 
From 213336 



SeqID n° 380 
SeqID n° 381 
SeqID n° 382 



LM-1438.1 
LM-1439.1 
LM-1442.1 



From 212366 
From 211425 
From 209470 



SeqID n° 384 
SeqID n° 385 

SeqID n° 386 

SeqID n° 387 

SeqID n° 388 



LM-1 445.1 
LM-1 446.1 

LM-1447.1 

LM-1 448.1 

LM-1449.3 



From 205819 
From 204624 

From 203640 

From 202641 

From 201217 



to 214409 
to 213668 



SeqID n° 379 LM-1437.1 From 212824 to 213285 



to 212689 
to 212294 
to 21 1389 



SeqID n° 383 LM-1444.1 From 207739 to 209271 



SeqID n° 389 LM-1451.1 From 1891236 

SeqID n° 390 LM-1453.1 From 1891944 

SeqID n 8 391 LM-1454.1 From 1892603 

SeqID n° 392 LM-1 457.1 From 1893497 



to 207408 
to 205577 

to 204353 

to 203597 

to 202590 

to 1891880 
to 1892600 

to 1893478 
to 1895464 



SeqID n° 393 LM-1458.1 From 1895461 to 1896219 



Unknown, 
polyribonucleotide 
nucleotidyltransferase 

domain present 
unknown, similar to B. 
subtilis DivIC protein 
Unknown, conserved 
membrane-spanning 

protein 
transcription-repair 

coupling factor 
Unknwon, similar to 
peptidyl-tRNA hydrolase 

Unknown 
Unknown, similar to B. 
subtilis general stress 

protein 
Unknwon, similar to L- 
lactate dehydrogenase 

Unknown 
Unknown, conserved 
hypothetical protein 
Unknown, hypothetical 
lipoprotein 
Unknwon 
phospholipase C 
actin-assembly inducing 

protein precursor 
Zinc metalloproteinase 

precursor 
listeriolysin O precursor 

phosphatidylinositol- 
specific phospholipase c 
listeriolysin positive 
regulatory protein 
phosphoribosyl 
pyrophosphate synthetase 
unknown, highly similar to 
UDP-N-acetylglucosamine 

pyrophosphorylase 
Unknown, weakly similar to 
thiamin pyrophosphokinase 
Unknown, similar to 
ribulose-5-phosphate 3- 
epimerase 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
putative serine/threonine- 
specific protein kinase 
Unknown, similar to 
putative phosphoprotein 
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SeqID n° 394 
SeqID n° 395 

SeqID n° 396 

SeqID n° 397 

SeqID n° 398 
SeqID n° 399 

SeqID n° 400 
SeqID n° 401 

SeqID n° 402 

SeqID n° 403 

SeqID n° 404 

SeqID n° 405 

SeqID n° 406 

SeqID n° 407 

SeqID n° 408 
SeqID n° 409 

SeqID n°410 



LM-1459.1 
LM-1461.1 

LM-1 463.1 

LM-1 465.1 

LM-1 466.2 
LM-1 467.2 

LM-1468.1 
LM-1469.1 

LM-1470.1 

LM-1471.1 

LM-1472.1 

LM-1473.1 



From 1896242 
From 1897577 

From 1898529 

From 1900927 

From 1902484 
From 1903120 

From 1904152 
From 1905952 

From 1906592 

From 1907218 

From 1907916 

From 1908827 



to 1897576 
to 1898515 

to 1900922 

to 1902126 

to 1903101 
to 1903995 

to 1905864 
to 1906551 

to 1907221 

to 1907919 

to 1908830 

to 1909591 

to 1912826 

to 1913910 

to 1915187 
to 1916086 

to 1917452 



LM-1474.1 From 1909614 

LM-1476.1 From 1912819 

LM-1477.1 From 1913907 

LM-1478.1 From 1915175 

LM-1480.1 From 1916166 



phosphatase 

Unknown, similar to RNA- 
binding Sun protein 
Unknown, similar to 
methionyl-tRNA 
formyltransferase 
unknown, similar to 
primosomal replication 
factorY 
unknown, similar to 
pantothenate metabolism 
flavoprotein homolog 
unknown, similar to 
guanylate kinases 
unknown, simlar to 
conserved hypothetical 
protein 
unknown, similar to 
fibronectin binding proteins 
unknown, similar to 
conserved hypotheticl 
proteins 
unknown, highly similar to 

orotate 
phosphoribosyltransferases 
unknown, highly similar to 
orotidine 5 -phosphate 
decarboxylases 
unknown, highly similar to 
dihydroorotase 
dehydrogenase 
unknown, highly similar to 

dihydroorotate 
dehydrogenase (electron 

transfer subunit) 
unknown, highly similar to 
carbamoyl-phosphate 
synthetase (catalytic 
subunit) 
unknown, highly similar to 
carbamoyl-phosphate 
synthetase (glutaminase 
subunit) 

unknown, highly similar to 

dihydroorotase 
unknown, highly similar to 
aspartate 
carbamoyltransferase 
unknown, highly similar to 
uracil permease 
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SeqlDn°411 LM-1481.2 From 1917581 to 1918132 



SeqID n° 412 LM-1482.2 From 1918363 to 1918593 
SeqID n° 413 LM-1483.2 From 645397 to 645858 



SeqID n° 414 LM-1484.1 From 645878 to 647590 

SeqID n° 415 LM-1486.1 From 647587 to 649404 

SeqID n° 416 LM-1487.1 From 649520 to 649819 

SeqID n° 417 LM-1491.1 From 651794 to 652420 

SeqID n° 418 LM-1492.1 From 652589 to 653044 

SeqID n° 419 LM-1493.1 From 653041 to 653982 

SeqID n° 420 LM-1494.1 From 654078 to 65461 1 

SeqID n° 421 LM-1495.1 From 654608 to 654850 

SeqID n° 422 LM-1498.1 From 654957 to 656708 



SeqID n° 423 LM-1 499. 1 From 656749 to 657243 
SeqID n° 424 LM-15.1 From 2720497 to 2721489 

SeqID n° 425 LM-1500.1 From 657323 to 658465 



SeqID n° 426 LM-1501.1 From 658572 
SeqID n° 427 LM-1502.1 From 659083 
SeqID n° 428 LM-1 504.1 From 659572 



SeqID n° 429 
SeqID n° 430 
SeqID n°431 

SeqID n° 432 
SeqID n° 433 

SeqID n° 434 
SeqID n° 435 



LM-1 506.1 From 660398 

LM-1507.1 From 660692 

LM-1 508.1 From 660996 

LM-1 51 0.3 From 661470 

LM-1 51 1.3 From 662257 

LM-1 516.1 From 669423 

LM-1518.1 From 670309 



to 659027 
to 659475 
to 660312 

to 660676 
to 660958 
to 661439 

to 662171 
to 663936 

to 669950 
to 672339 



unknown, highly similar to 
pyrimidine operon 
regulatory protein 

unknown 
Unknown, similar to 
transcription regulator 
MarR family 
Unknown, similar to ABC 
transporter, ATP-binding 

protein 
Unknown, similar to ABC 
transporter, ATP-binding 

protein 
Unknown, similar to E. coli 
phage shock protein E 
Unknown, similar to acyl- 

carrier protein 
phosphodiesterase and 
NAD(P)H dehydrogenase 
Unknown, similar to 
transcription regulator 

MarR family 
Unknown, similar to 

oxidoreductase 
unknown, conserved 
hypothetical protein 

Unknown 
Unknown, C-terminal 
domain similar to 
glycerophosphoryl diester 
phosphodiesterase 

Unknown 
Unknown, similar to 
Phosphodiesterase 
Unknown, similar to protein 
kinase 
Unknown 
Unknown 
Unknown, conserved 
hypothetical protein 
Unknown, hypothetical 
Unknown 
Unknown, similar to 
unknown proteins 

Unknown 
Unknown, similar to 
unknown protein 
Unknown 
Unknown, similar to 
transcription antiterminator 
BgIG family 



WO 01/77335 



PCT/FR01/01118 



67 



SeqlDn°436 LM-1519.1 From 672341 to 672793 



SeqID n° 437 LM-1520.1 From 672794 to 673855 



SeqID n° 438 LM-1521 .1 From 673870 to 674178 



SeqID n° 439 LM-1 523. 1 From 674205 to 675473 



SeqID n° 440 LM-1524.1 From 675563 to 676267 

SeqID n° 441 LM-1 525.1 From 676372 to 676788 

SeqID n° 442 LM-1527.1 From 676802 to 677395 

SeqID n° 443 LM-1 528.1 From 678494 to 679123 

SeqID n° 444 LM-1 529.1 From 679763 to 680296 



SeqID n° 445 LM-153.1 From 1216820 to 1218178 

SeqID n° 446 LM-1530.1 From 680360 to 681277 

SeqID n° 447 LM-1532.1 From 681543 to 683423 

SeqID n° 448 LM-1533.1 From 683519 to 684247 

SeqID n° 449 LM-1 535. 1 From 684390 to 685040 

SeqID n° 450 LM-1 536.3 From 685083 to 686903 

SeqID n° 451 LM-1 537.2 From 4681 69 to 469437 



SeqID n° 452 LM-1538.1 From 467519 to 468136 

SeqID n° 453 LM-154.1 From 1215993 to 1216487 

SeqID n° 454 LM-1 540.1 From 466517 to 467362 

SeqID n° 455 LM-1541.1 From 465934 to 466419 

SeqID n° 456 LM-1 545.1 From 459681 to 465722 

i 

SeqID n° 457 LM-1 547.1 From 457021 to 458913 

SeqID n° 458 LM-1 549.1 From 454534 to 456936 

SeqID n° 459 LM-1 55.1 From 1215128 to 1215679 



Unknown, similar to PTS 
system, fructose-specific 

IIA component 
Unknown, similar to PTS 
system, fructose-specific 

IIC component 
Unknown, similar to PTS 
system, fructose-specific 

IIB component 
Unknown, similar to an E. 
coli putative tagatose 6- 
phosphate kinase 

Unknown 
Unknown, similar to 
unknown proteins 
Unknown, weakly similar to 
methyltransferase 
Unknown 
Unknown, similar to a 
transcription regulator 
(surface protein PAg 
negative regulator par) 
Unknown, similar to 
cobyrinic acid a.c-diamide 
synthase 
Unknown, similar to 
oxidoreductase 
Unknown, similar to heavy 
metal-transporting ATPase 
Unknown 
Unknown, similar to 
putative transaldolase 
Unknown, similar to 
conserved hypothetical 
proteins 
Unknown, weakly similar to 
a module of peptide 
synthetase 
unknown 
unknown 
Unknown, conserved 
hypothetical protein 
Unknown, similar to 
unknown proteins 
Unknown, putative 
peptidoglycan linked 
protein (LPXTG motif) 
Internalin B 
Intemalin A 
Unknown, similar to 
transcriptional regulator 
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SeqID n° 460 LM-1551 .1 From 453107 to 453853 

SeqID n° 461 LM-1553.1 From 452524 to 453093 

SeqID n° 462 LM-1554.1 From 451531 to 452406 

SeqID n° 463 LM-1557.1 From 448926 to 451508 

SeqID n° 464 LM-1 558.1 From 447804 to 448910 



SeqID n° 465 LM-156.1 From 1213636 to 1215087 
SeqID n° 466 LM-1 560. 1 From 447471 to 447791 



SeqID n° 467 LM-1561.1 From 447010 to 447474 



SeqID n° 468 LM-1564.1 From 445049 to 447010 



SeqID n° 469 LM-1 566.1 From 444031 to 444888 



SeqID n° 470 LM-1 568.1 From 443303 to 443851 



SeqID n° 471 LM-1569.1 From 442869 
SeqID n° 472 LM-157.1 From 1212983 
SeqID n° 473 LM-1570.1 From 441622 



SeqID n° 474 

SeqID n° 475 

SeqID n° 476 
SeqID n° 477 
SeqID n° 478 

SeqID n° 479 

SeqID n° 480 

SeqID n° 481 
SeqID n° 482 



LM-1 571.1 

LM-1572.1 

LM-1573.1 
LM-1 574.1 
LM-1 575.2 

LM-1 577.4 

LM-1 578.2 

LM-1 579.1 
LM-1 58.1 



From 440758 

From 440056 

From 439677 
From 439214 
From 437482 

From 436379 

From 977051 

From 976065 
From 1211869 



to 443300 
to 1213426 
to 442872 

to 441570 

to 440610 

to 439976 
to 439624 
to 438882 

to 437188 

to 978856 

to 977039 
to 1212990 



Unknown, similar to 

oxidoreductase 
Unknown, similar to 
acetyltransferase 
Unknown, similar to 
transcriptional regulator 

(LysR family) 
Unknown, similar to sugar 

hydrolase 
Unknown, similar to PTS 
fructose-specific enzyme 
IIC component 
unknown 
Unknown, similar to PTS 
fructose-specific enzyme 

I IB component 
Unknown, similar to PTS 
fructose-specific enzyme 
HA component 
Unknown, similar to 
transcription antiterminator 
BgIG family 
Unknown, similar to 
Staphylococcus xylosus 
glucose uptake protein 
Unknown, similar to RNA 
polymerase ECF-type 

sigma factor 
Unknown, similar to 
unknown protein 
similar to ethanolamine 
utilization protein EutQ 
Unknown, similar to rod 
shape-determining protein 
RodA 
Unknown, conserved 
hypothetical protein 
Unknown, similar to 
unknown protein 
Unknown 
Unknown 
Unknown, similar to endo- 
1,4-beta-xylanase 
Unknown, conserved 
membrane protein 
unknown, similar to heat 
shock protein HtpG 

Unknown 
Unknown, similar to 
ethanolamine utilization 
protein EutH - Escherichia 
coli 
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SeqID n° 483 
SeqID n° 484 
SeqID n°485 


LM-1 580.1 
LM-1 581.1 
LM-1 582.1 


From 975071 
From 974229 
From 973758 


to 976033 
to 974975 
to 974216 


SeqID n° 486 


LM-1583.1 


From 972719 


to 973459 


SeqID n° 487 


LM-1 584.1 


From 972197 


to 972706 


Spain n° 48ft 

OCI^IU II too 


1 M-1 *ift»; 1 

l_IV|- 1 vJOO. 1 


Prnm Q71fW7 

rrurn 57 § \uof 




SeqID n° 489 


LM-1 586.1 


From 969549 


to 970496 


SeqID n° 490 


LM-1 587.1 


From 968819 


to 969424 


Sea ID n° 491 

VwVjlU I I ~ W 1 


LM-1 589 1 


From Q677A4 

11 will wwf f fj*t 


tn QRR77Q 

IU BOO 1 I %J 


SeqID n° 492 


LM-159.1 


From 1211308 


to 1211853 


SeqID n° 493 


LM-1590.1 


From 967029 


to 967760 


SeqID n° 494 
SeqID n° 495 


LM-1 592.1 
LM-1 593.1 


From 966245 
From 965513 


to 966913 
to 966136 


SeqID n° 496 


LM-1 594.1 


From 963294 


to 965255 


SeqID n° 497 


LM-1 596.1 


From 962607 


to 963263 


SeqID n° 498 


LM-1597.1 


From 961469 


to 962584 


SeqID n 8 499 


LM-16.1 


From 2721559 


to 2722857 


SealD n° 500 

UWUII^ || WWW 


LM-1 60 1 

1—1 V 1 I WW. 1 


From 191 1tMfi 
nun i i c- 1 i y*tu 


to 191 1^15 

IU I IO ID 


SeqID n° 501 


LM-1600.1 


From 960748 


to 961221 


SeqID n° 502 


LM-1602.1 


From 959711 


to 960631 


SeqID n° 503 


LM-1603.1 


From 958783 


to 959622 


SeqID n° 504 


LM-1605.1 


From 957724 


to 958764 



Unknown 
Unknown 
unknown, similar to protein- 
tyrosine-phosphatase 
unknown, similar to 
Nitroflavin-reductase 
Unknown, similar to B. 
subtilis CspR protein, rRNA 
methylase homolog 
unknown, similar to B. 
subtilis YhbA protein 
unknown, similar to B. 
subtilis YkcC protein 
unknown, conserved 
hypothetical protein 
unknown, similar to lipoate 
protein ligase A 
Unknown, similar to 
Salmonella enterica PduT 
protein 
unknown, conserved 
hypothetical protein, similar 
to B. subtilis Yhfl protein 

unknown, similar to sortase 
unknown, similar to 3- 
methyladenine DNA 

glycosylase 
unknown, hypothetical 
transmembrane protein 

unknown, similar to 
transcription regulator 
(TetR/AcrR family) 
unknown, putative 
membrane protein 
Unknown, similar to 
membrane proteins 
Unknown, similar to carbon 
dioxide concentrating 
mechanism protein 
Unknown, similar to ABC 
transporter, ATP-binding 
protein (N-terminal part) 
unknown, similar to 
pantothenate kinase 
unknown, similar to B. 
subtilis YcgQ protein 
unknown, similar to B. 
subtilis YcgR protein 
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SeqID n° 505 LM-1606.1 From 956031 to 957602 

SeqID n° 506 LM-1607.1 From 953842 to 955740 

SeqID n° 507 LM-1608.1 From 952318 to 953769 

SeqID n° 508 LM-1 609.1 From 951956 to 952306 



SeqID n° 509 LM-161.1 From 1210459 to 1211031 
SeqID n° 510 LM-1610.1 From 950673 to 951941 



SeqID n° 51 1 LM-161 1.1 From 950363 to 950668 

SeqID n° 512 LM-1613.1 From 948754 to 950220 

SeqID n° 513 LM-1614.1 From 947718 to 948530 

SeqID n° 514 LM-161 5.1 From 947101 to 947604 

SeqID n° 515 LM-1617.1 From 945915 to 947018 

SeqID n° 516 LM-1618.1 From 945520 to 945912 

SeqID n° 517 LM-1619.1 From 944243 to 945379 

SeqID n° 518 LM-162.1 From 1209821 to 1210462 



SeqID n° 519 LM-1620.1 From 943526 
SeqID n° 520 LM-162Z1 From 942148 



SeqID n° 521 
SeqID n° 522 
SeqID n° 523 



LM-1623.1 From 941615 
LM-1624.2 From 940810 
LM-1625.2 From 469575 



SeqID n° 524 LM-1626.1 From 470122 



to 944200 

to 943497 

to 942112 
to 941484 
to 470078 
to 472158 



SeqID n° 525 LM-1627.1 From 472330 to 472644 
SeqID n° 526 LM-1629.1 From 472781 to473710 



SeqID n° 527 LIVM63.1 From 1209053 to 1209808 



unknown, similar to ABC 
transporter ATP-binding 
protein (antibiotic 

resistance) 
Unknown, similar to 
transcription antiterminator 

BgIG family 
Unknown, similar to beta- 
glucosidase 
Unknown, similar to 
phosphotransferase 
system enzyme HA 

unknown 
Unknown, similar to 
phosphotransferase 
system enzyme IIC 
Unknown, similar to PTS 
system, MB component 

Unknown, similar to 
succinate semialdehyde 
dehydrogenase 
unknown, similar to 
transporters (formate) 
Unknown 
Unknown 
Unknown, similar to 
transcription regulator, 
GntR family 
unknown, similar to 
membrane proteins . 
Unknown, similar to 
Salmonella enterica PduL 
protein 
unknown, similar to 
phosphoglycerate mutase 
Unknown, similar to 
glutathione Reductase 
unknown 
Unknown 
Unknown 
Unknown, similar to 
penicillin-binding protein 
(D-alanyl-D-alanine 
carboxypeptidase) 
Unknown 
Unknown, similar to B. 
subtilis transcription 

regulator LytR 
unknown, similar to 
cobalamin adenosyl 
transferase 
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SeqID n° 528 


LM-1631.1 


From 473936 


to 476716 


SeqID n° 529 


LM-1632.1 


From 476960 


to 478447 


oeqiu n ooU 


LM- 1634.1 


From 478721 


to 479710 


SeqID n° 531 


LM-1635.1 


From 479765 


to 481 153 


q pn |n n Q 


LIVI- IOOO. I 


rrOm HO\/Jyj 


to 482701 


OCLj 1 L^ || jOO 

SeqID n° 534 


LI VI- I Oj f . I 

LM-1638.1 


rrom 400 loo 
From 483934 


to 483891 
to 484599 


SeqID n° 535 
SeqID n° 536 


LM-1 639.1 
LM-1 64.1 


From 484620 
From 1208603 


to 485375 
to 1208887 


oeqiu n 00/ 


1 tiA A.G.AA A 

LIV1-1044. 1 


From 485493 


to 487652 


Q 0 «m n° COO 

oeqiu n 000 


LA/l-1o4o.1 


From 487649 


to 488797 


oeqiu n ooy 


1 KA AC2ACi A 

LM-1040.1 


Prom 488802 


to 489749 


oeqiu ii o*fu 


UIVI- I OH f . X 


rrom 4oyyuo 


to 491 500 


SeqID n° 541 


LM-1 650.1 


From 491634 


to 492917 


SeqID n° 542 


LM-1652.1 


From 492918 


to 494018 


SeqID n° 543 


LM-1 653.1 


From 494011 


to 495561 


SeqID n° 544 


LM-1 655.1 


From 496381 


to 497919 


SeqID n° 545 


LM-1656.1 


From 498171 


to 500240 


SeqID n 6 546 
SeqID n° 547 
SeqID n° 548 


LM-1 658.1 
LM-1659.3 
LM-1 66.1 


From 500306 
From 500799 
From 1207111 


to 500779 
to 501284 
to 1208571 


SeqID n° 549 


LM-1661.4 


From 2632907 


to 2633761 



Unknown, conserved 
hypothetical protein 
Unknown, similar to 
transcription regulator 
Unknown, similar to 
penicillin acylase and to 
conjugated bile acid 

hydrolase 
Unknown, similar to 
glutamate decarboxylase 
Unknown, similar to amino 
acid antiporter 
Unknown 
Unknown, similar to 
unknown proteins 

Unknown 
unknown, similar to 
putative carboxysome 
structural protein 
Unknown, similar to 
unknown proteins 
Unknown, conserved 
hypothetical proteins 
Unknown, conserved 
hypothetical protein similar 
to B. subtilis YeaC 
Unknown, similar to 
unknown proteins 
Unknown, similar to 

permeases 
Unknown, similar to 
unknown proteins 
Unknown, similar to 

hydantoinase 
Unknown, similar to 
transcription regulator (VirR 
from Streptococcus 

pyogenes) 
Unknown, putative 
membrane associated 
lipoprotein 
Unknown 
Unknown 
unknown, similar to 
acetaldehyde 
dehydrogenase / alcohol 
dehydrogenase 
Unknown, similar to 
fructose-1 ,6-bisphosphate 
aldolase 
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SeqlDn°550 LM-1662.2 From 2631303 to 2632586 



SeqlDn°551 LM-1663.1 

SeqID n° 552 LM-1664.1 

SeqID n° 553 LM-1665.1 

SeqID n° 554 LM-1666.2 

SeqID n° 555 LM-1668.2 

SeqID n° 556 LM-1669.1 

SeqID n° 557 LM-167.1 

SeqID n° 558 LM-1671.1 

SeqID n° 559 LM-1674.1 

SeqID n° 560 LM-1676.1 

SeqID n° 561 LM-1678.1 

SeqID n° 562 LM-1679.1 

SeqID ri° 563 LM-168.1 

SeqID n° 564 LM-1681.1 

SeqID n° 565 LM-1682.1 

SeqID n° 566 LM-1683.1 

SeqID n° 567 LM-1684.1 

SeqID n° 568 LM-1685.1 



From 2630285 to 2631310 

From 2629208 to 2630278 

From 2627816 to 2629087 

From 2626442 to 2627713 

From 2625414 to 2626361 

From 2624980 to 2625417 

From 1206590 to 1207111 

From 2623125 to 2624411 

From 2622067 to 2623122 

From 2621201 to 2622067 

From 2620514 to 2621089 

From 2619415 to 2620491 

From 1205922 to 1206575 

From 2618577 to 2619428 

From 2617239 to 2618276 

From 2616835 to 2617242 

From 261 5458 to 2616699 

From 2614692 to 2615321 



Unknwon, weakly similar to 
human N- 
acetylglucosaminyl- 
phosphatidylinositol 
biosynthetic protein 
Unknown, similar to 
galactosyltransferase 
Unknown, conserved 
hypothetical protein 
unknown, highly similar to 
UDP-N-acetylglucosamine 
1-carboxyvinyltransferase 
unknown, highly similar to 
transcription terminator 
factor rho 
Unknwon, similar to 
glycosyl transferases 
wall teichoic acid 
glycosylation protein GtcA 
Unknown, similar to 
putative carboxysome 
structural protein 
unknown, highly similar to 
homoserine 
dehydrogenase 
unknown, highly similar to 

threonine synthase 
unknown, highly similar to 
homoserine kinase 
Unknwon, similar to 
thymidine kinase 
unknown, highly similar to 
peptide chain release 

factor 1 
Unknown, similar to 
putative carboxysome 
structural protein (eutL) 
Unknown, similar to 
protoporphyrinogen 
oxidase 
Unknown, similar to yeast 
translation initiation protein 

Unknown, similar to 
phosphatases 
unknown, highly similar to 

glycine 
hydroxymethyltransferase 
unknown, highly similar to 
uracil 

phosphoribosyltransferase 
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SeqID n° 569 LM-1686.1 From 2613435 to 2614574 Unknwon, similar to UDP- 

N-acetyiglucosamine 2- 
epimerase 

SeqID n° 570 LM-1691.1 From 2612205 to 2612603 unknown, highly similar to 

ATP synthase subunit i 
SeqID n° 571 LM-1693.1 From 261 1482 to 2612198 unknown, highly similar to 

H+-transporting ATP 
synthase chain a 

SeqID n° 572 LM-1694.1 From 2610592 to 2611104 unknown, highly similar to 

H+-transporting ATP 
synthase chain b 

SeqID n° 573 LM-1695.1 From 2610056 to 2610595 unknown, highly similar to 

H+-transporting ATP 
synthase chain delta 

SeqID n° 574 LM-1697.1 From 2608515 to 2610029 unknown, highly similar to 

H+-transporting ATP 
synthase chain alpha 

SeqID n° 575 LM-1698.1 From 2607605 to 2608477 unknown, highly similar to 

H+-transporting ATP 
synthase chain gamma 
SeqID n° 576 LM-170.1 From 1205018 to 1205899 Unknown, similar to 

ethanolamine ammonia- 
lyase, light chain 

SeqID n° 577 LM-1700.1 From 2606123 to 2607544 unknown, highly similar to 

H+-transporting ATP 
synthase chain beta 

SeqID n° 578 LM-1701.1 From 2605697 to 2606101 unknown, highly similar to 

H+-transporting ATP 
synthase chain epsilon 

SeqID n° 579 LM-1702.1 From 2605337 to 2605573 Unknown, similar to B. 

subtilis YwzB protein 

SeqID n° 580 LM-1 704. 1 From 2603863 to 26051 55 unknown, UDP-N- 

acetylglucosamine 1- 
carboxyvinyltransferase 

SeqID n° 581 LM-1 705.2 From 2602705 to 2603700 unknown, similar to MreB- 

like protein 

SeqID n° 582 LM-1706.2 From 521863 to 522591 Unknown 
SeqID n° 583 LM-1 707.1 From 522628 to 523521 Unknown, similar to 

transcriptional regulator 
(LysR family) 

SeqID n° 584 LM-1708.1 From 523719 to 525713 Unknown, similar to 

NADH:flavin 
oxidoreductase 

SeqID n° 585 LM-171 .1 From 1203634 to 1204998 Unknown, similar to 

ethanolamine ammonia- 
lyase, heavy chain 

SeqID n° 586 LM-1710.1 From 525813 to 526688 Unknown, similar to 

shikimate 5- 
dehydrogenase 

SeqID n° 587 LM-1712.1 From 526754 to 527512 Unknown, similar to 3- 

dehydroquinate 
dehydratase 
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Caftin — o coo 

beqiu n boo 


uvi-1713.1 


From 527580 


to 528488 


SeqID n° 589 


LM-1714.1 


From 528563 


to 530323 


SeqID n° 590 


LM-1715.1 


From 530515 


to 531108 


SeqID n° 591 


LM-1716.1 


From 531294 


to 532253 


oeqiu n osz 


1 AJ1 A~7A~7 A\ 

LM-1 /I (A 


From 532328 


to 532552 


SeqID n° 593 


LM-1718.2 


From 532603 


to 5341 11 


SeqID n° 594 


LM-1719.1 


From 534307 


to 534756 


SeqID n° 595 


LM-1721.1 


From 534753 


to 535421 


SeqID n° 596 


LM-1722.1 


From 535428 


to 536078 


SeqID n° 597 


LM-1 723.1 


From 536187 


to 538247 


SeqID n° 598 


LM-1724.1 


From 538251 


to 538853 


SeqID n° 599 


LM-1725.1 


From 538881 


to 539348 


SeqID n° 600 
beqiu n oui 


LM-1726.1 
LM-1727.1 


From 539365 
From 539774 


to 539763 
to 540424 


SeqID n° 602 


LM-1728.1 


From 540421 


to 541467 


SeqID n° 603 


LM-1 729.1 


From 541483 


to 541776 


SeqID n° 604 


LM-1 73.1 


From 1202171 


to 1203592 


SeqID n° 605 


LM-1731.1 


From 541791 


to 543062 


SeqID n° 606 


LM-1733.1 


From 543202 


to 544137 


SeqID n° 607 
SeqID n°608 


LM-1734.1 
LM-1735.1 


From 544923 
From 545609 


to 545501 
to 546289 


SeqID n°609 


LM-1736.1 


From 546314 


to 546673 



Unknown, similar to 
transcriptional regulator 
(LysR family) 
Unknown, similar to 
acylase 
Unknown, weakly similar to 
esterase 
Unknown, similar to 
transmembrane protein 
Unknown, similar to B. 
subtilis YnzC protein 
Unknown, similar to sugar 

transferase 
Unknown, similar to ribose 
5-phosphate isomerase 

Unknown, similar to 
ribulose-5-phosphate 3 
epimerase 
Unknown, similar to 

transaldolase 
Unknown, similar to 
transcription antiterminator 
BgIG family 
Unknown, similar to 
putative sugar-phosphate 

isomerase 
Unknown, similar to PTS 
fructose-specific enzyme 
IIA component 
unknown 
Unknown, similar to 
ribulose-5-phosphate 3- 

epimerase 
Unknown, similar to polyol 
(sorbitol) dehydrogenase 
Unknown, similar to PTS 
system, Galactitol-specific 
II B component 
Unknown, similar to 
ethanolamine utilization 
protein EutA (putative 
chaperonin) 
Unknown, similar to PTS 
system, Galactitol-specific 
IIC component 
Unknown, similar to 
phosphoribosyl 
pyrophosphate synthetase 
Unknown 
Unknown, conserved 
hypothetical protein 
Unknown 
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SeqID n° 610 LM-1737.1 From 546680 to 547138 

SeqID n° 611 LM-174.1 From 1200625 to 1202082 

SeqID n° 612 LM-1740.1 From 549438 to 549869 

SeqID n° 613 LM-1741.1 From 549916 to 551346 

SeqID n° 614 LM-1742.2 From 551467 to 552282 

SeqID n" 615 LM-175.1 From 1200051 to 1200632 

SeqID n° 616 LM-1755.2 From 2678302 to 2679330 

SeqID n° 617 LM-1757.1 From 2681167 to 2682018 

SeqID n° 618 LM-1758.1 From 2682040 to 2682462 



SeqID n° 619 LM-1759.1 From 2682584 to 2682943 
SeqID n° 620 LM-176.1 From 1198637 to 1199776 



SeqID n° 621 


LM-1760.1 


From 


2683187 


to 2684056 


SeqID n° 622 
SeqID n° 623 
SeqID n° 624 


LM-1 761.1 
LM-1762.1 
LM-1763.1 


From 
From 
From 


2684251 
2684665 
2685297 


to 2684643 
to 2685102 
to 2686043 


SeqID n° 625 


LM-1 766.1 


From 


2686049 


to 2686846 


SeqID n° 626 


LM-1 769.1 


From 


2686849 


to 2687715 


SeqID n° 627 


LM-1771.1 


From 


2687691 


to 2688530 


SeqID n° 628 


LM-1 772.1 


From 


2688661 


to 2689323 


SeqID n° 629 
SeqID n° 630 
SeqID n° 631 
SeqID n° 632 
SeqID n° 633 


LM-1773.1 
LM-1774.1 
LM-1 775.2 
LM-1 776.3 
LM-1777.3 


From 
From 
From 
From 
From 


2689442 
2690371 
2691673 
1441313 
1441581 


to 2690332 
to 2691465 
to 2692080 
to 1441540 
to 1442114 


SeqID n° 634 


LM-1 779.1 


From 


1443873 


to 1445042 



Unknown, weakly similar to 
transcription regulator 
unknown, similar to 
sensory transduction 

histidine kinase 
Unknown, conserved 
hypothetical protein 
Unknown, similar to 
Bacillus anthracis 
encapsulation protein 

CapA 
Unknown, similar to 
phosphoglycerate mutase 
unknown, similar to similar 
to two-component 
response regulator 
Unknown, similar to ATP 
binding proteins 
Unknown, similar to 
oxidoreductase, aldo/keto 
reductase family 
Unknown, similar to 
transcription regulators 
(MerR family) 
Unknown 
unknown, similar to 
NADPH-dependent butanol 
dehydrogenase 
Unknown, similar to 
unknown proteins 
ribosomal protein S9 
ribosomal protein L13 
unknown, highly similar to 
pseudouridylate synthase I 
Unknown, highly similar to 
B. subtilis YbaF protein 
Uknown similar to ABC 
transporter (ATP-binding 

protein) 
Unknown, similar to ABC 
transporter (ATP-binding 
protein) 
Unknown, conserved 
hypothetical protein 
Unknown 
Unknown 
ribosomal protein L17 
Unknown 
Unknown, modulates DNA 

topology 
Unknown, similar to Acetyl- 
CoA:acetyItransferase 
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SeqID n° 635 LM-1780.2 From 1445192 to 1446358 Unknown, similar to 

hydroxy-3-methylglutaryl 
coenzyme A synthase 
SeqID n° 636 LM-1781.2 From 1446383 to 1446982 Unknown 
SeqID n° 637 LM-1782.1 From 1447052 to 1448296 Unknown, highly similar to 

B. subtilis YxiO protein 
SeqID n° 638 LM-1783.1 From 1448315 to 1449583 Unknown, weakly similar to 

pyrophosphatase 

SeqID n° 639 LM-1784.1 From 1449647 to 1450684 Unknown, conserved 

hypothetical protein 

SeqID n° 640 LM-1785.1 From 1450701 to 1451597 Unknown, weakly similar to 

UDP-N-acetylglucosaminyl- 
3-enolpyruvate reductase 

SeqID n° 641 LM-1787.1 From 1451813 to 1452799 Unknown, similar to glycine 

betaine/carnitine/choline 
ABC transporter 

SeqID n° 642 LM-1789.1 From 1452796 to 1454310 Unknown, similar to glycine 

betaine/carnitine/choline 
ABC transporter 
(membrane protein) 

SeqID n° 643 LM-179.1 From 1197172 to 1198047 unknown, similar to 

Salmonella enterica PduX 
protein 

SeqID n° 644 LM-1790.1 From 1454347 to 1455177 Unknown 
SeqID n° 645 LM-1791 .2 From 1455316 to 1456662 Unknown, similar to metal 

ion transport proteins 

SeqID n° 646 LM-1794.2 From 1456775 to 1457446 Unknown, similar to 

betaine/carnitine/choline 
ABC transporter 
(membrane p) 

SeqID n° 647 LM-1795.1 From 1457461 to 1458387 Unknown, similar to glycine 

betaine/carnitine/choline 

ABC transporter 
(osmoprotectant-binding 
protein) 

SeqID n° 648 LM-1 796. 1 From 1 458389 to 1 459045 Unknown, similar to glycine 

betaine/carnitine/choline 
ABC transporter 
(membrane protein) 

SeqID n° 649 LM-1798.1 From 1459049 to 1460242 Unknown, similar to glycine 

betaine/carnitine/choline 
ABC transporter (ATP- 
binding protein) 

SeqID n° 650 LM-1799.1 From 1460534 to 1461094 Unknown, similar to 

unknown proteins 

SeqID n° 651 LM-1 8.1 From 2722884 to 2723156 Unknwon, similar to 

hypothetical PTS enzyme 
IIB component 

SeqID n° 652 LM-1800.2 From 1461343 to 1461726 Unknown, similar to 

unknown proteins 
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SeqID n° 653 


LM-1 801 .2 


From 


1461866 


SeqID n° 654 
SeqID n° 655 


LM-1 802.1 
LM-1 803.1 


From 
From 


1463506 
1464209 


SeqID n" 656 


LM-1 805.1 


From 


1465638 


SeqID n° 657 


LM-1806.1 


From 


1467325 


SeqID n° 658 


LM-1 807.1 


From 


1468221 


SeqID n° 659 


LM-1809.1 


From 


1469444 


SeqID n 660 


i hA A OA A A 

LM-1810.1 


From 


1470685 


SeqID n° 661 
SeqID n° 662 


LM-1811.1 
LM-1 81 3.1 


From 
From 


1472980 
1473889 


SeqID n° 663 


LM-1814.1 


From 


1474802 


SeqID n° 664 


LM-1815.1 


From 


1475944 


beqlD n oob 
SeqID n° 666 


l K il A QAd A 

LM-lolb.1 
LM-1817.1 


From 
From 


A A 7CC A O 
14/004Z 

1477050 


SeqID n° 667 


LM-1818.1 


From 


1478071 


SeqID n° 668 


LM-182.1 


From 


1194890 


SeqID n° 669 
SeqID n 670 
SeqID n° 671 


LM-1 820.1 

1 R Jl A t±+SA A 

LM-1821.1 
LM-1823.1 


From 
From 
From 


1478477 
1479330 
1480246 


SeqID n° 672 


LM-1825.1 


From 


1481188 


SeqID n° 673 


LM-1 826.1 


From 


1482096 


SeqID n° 674 


LM-1 827.1 


From 


1483549 


SeqID n° 675 


LM-1 829.1 


From 


1484590 


SeqID n° 676 


LM-183.1 


From 


1194120 



to 


1463467 


Unknown, similar to ABC 






transporter (ATP-binding 






protein) 


to 


1464156 


Unknown 


to 


1465549 


Unknown, similar to 






glutathione reductase 


to 


1467305 


Unknown, similar to 






unknown proteins 


to 


1468206 


Unknown, similar to 






dihydrodipicolinate 






synthase 


to 


1469432 


Unknown, similar to 






aspartokinase I (alpha and 






beta subunits) 


to 


1470487 


Unknown, similar to 






aspartate-semialdehyde 






dehydrogenase 


to 


1472850 


unknown, similar to 






penicillin-binding protein 


to 


1473588 


superoxide dismutase 


to 


1474689 


unknown, similar to 






unknown proteins 


to 


1475908 


Unknown, similar to 






putative peptidoglycan 






acetylation protein 


to 


1476651 


Unknown, similar to 






transport proteins 


to 


1476968 


Unknown 


to 


1477934 


Unknown, similar to protein 






secretion PrsA (post- 






translocation molecular 






chaperone) 


to 


1478496 


transcriptional regulator 






ZurR (ferric uptake 






regulation) 


to 


1196083 


Unknown, similar to 






acetate kinase 


to 


1479355 


metal transport protein 


to 


1480103 


ABC transporter 


to 


1481172 


Unknown, conserved 






hypothetical protein 


to 


1482081 


unknown, similar to 






endonuclease IV 


to 


1483403 


Unknown, similar to ATP- 






dependent RNA helicase, 






DEAD-box family (deaD) 


to 


1484544 


Unknown, similar to E. coli 






LytB protein 


to 


1485711 


Unknown, conserved 






hypothetical protein 


to 


1194824 


Unknown, similar to 






glycerol uptake facilitator 
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SeqID n° 677 


LM-1830.1 


From 1485708 


to 1486412 


SeqID n° 678 


LM-1833.2 


From 1486480 


to 1487604 


SeqID n° 679 


LM-1 834.2 


From 289562 


to 290521 


SeqID n° 680 


LM-1 835.1 


From 289139 


to 289555 


SeqID n° 681 


LM-1 836.1 


From 287853 


to 288992 


SeqID n° 682 
SealD n° 683 


LM-1838.1 
LM-1 84.1 


From 286219 
From 1192974 

1 IwJII 1 1 V&.V ( *T 


to 287718 

to 11Q40C19 


SeqID n° 684 
SeqID n° 685 
SeqID n° 686 


LM-1 840.1 
LM-1 842.1 
LM-1843.1 


From 284365 
From 282755 
From 281021 


to 286011 
to 284227 
to 282481 


SealD n° 687 


I M-1R44 1 

L_l VI 1 Utt. | 




io zouyoo 


SealD n° 688 


LM-1 85 1 

l_J VI 1 ww . | 


From 11Q154Q 

n wll I 1 Is 1 w**27 


tn 1 1Q9Q*^ 


SeqID n° 689 


LM-1852.1 


From 276728 


to 280333 


SeqID n° 690 


LM-1 854.1 


From 273003 


to 276557 


SealD n" 691 

vJv^l L*J 1 1 WW 1 


I M-1 fl^R 9 


From 971 ^OA 
"I will £. f I 0£>H 


tn 979509 


SealD n°692 


I M-1 858 1 

t_IVI~ IwJO, 1 


Frnm 9RQ754 
r ( ui 1 1 ^w9 / o*t 


tn 9709*V7 


SealD n° 693 

WWX^IL/ II www 

SeqID n° 694 


LM-1 859 2 

L1VI 1 w w w.£_ 

LM-1 860.2 


From 969087 

PI wll I cwUU/ 

From 1993372 


to 9RQ79Q 

to 1994637 


SeqID n° 695 


LM-1861.1 


From 1991047 


to 1993326 


SealD n° 696 

VvV^IL^ 1 I www 


LM-1 862 1 

^1 VI 1 \J\J4mm 1 


From 1Q8Q805 

1 1 wl 1 1 IwOwwww 


to 1QQ0B19 


SealD n° 697 


LM-1 863 1 


From 1Q8815Q 


to 1QQQQ09 

IU I v70c70UZ 


SeqID n° 698 


LM-1 864.1 


From 1987365 


to 1988048 


SeqID n° 699 


LM-1865.1 


From 1986342 


to 1987346 


SeqID n° 700 


LM-1 866.1 


From 1985218 


to 1986345- 



protein 

Unknown, conserved 
hypothetical protein 
RNA polymerase sigma 

factor RpoD 
Unknown, similar to other 
proteins 
Unknown, similar to 
transcriptional regulators 

Unknown, similar to 
succinyldiaminopimelate 
desuccinylase 
intemalin E 
unknown, similar to 
NADPH-dependent butanol 
dehydrogenase 
intemalin H 
intemalin G 
Unknown, similar to 
phospho-beta-glucosidase 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
ethanolamine utilization 

protein EutE 
RNA polymerase (beta 

subunit) 
RNA polymerase (beta 
subunit) 
Unknown, similar to 

unknown protein 
Unknown, similar to 
unknown protein 
Unknown 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
pyruvate formate-lyase 
Unknown, similar to 

peptidase 
Unknown, similar to 
malolactic enzyme (malate 
dehydrogenase) 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
(hypothetical sensory 
transduction histidine 
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kinase) 



SeqlDn°701 LM-1 867.1 From 1 984056 to 1985192 

SeqlDn°702 LM-1869.1 From 1982630 to 1983736 

SeqlDn°703 LM-1871.1 From 1981774 to 1982640 

SeqlDn°704 LM-1872.1 From 1981299 to 1981634 

SeqIDn°705 LM-1874.1 From 1980511 to 1981302 

SeqID n° 706 LM-1 875. 1 From 1 980091 to 1 980495 

SeqID n° 707 LM-1876.1 From 1978895 to 1980076 

SeqID n° 708 LM-1877.1 From 1977928 to 1978905 

SeqID n° 709 LM-1879.1 From 1977307 to 1977780 

SeqID n° 710 LM-188.1 From 1190547 to 1191542 

SeqID n° 711 LM-1880.1 From 1976307 to 1977140 

SeqID n° 712 LM-1881.1 From 1975446 to 1976303 

SeqID n° 713 LM-1882.1 From 1975059 to 1975442 

SeqID n° 714 LM-1884.1 From 1972176 to 1974962 

SeqID n° 715 LM-1885.1 From 1971547 to 1972131 

SeqID n° 716 LM-1886.1 From 1970343 to 1971524 

SeqID n° 71 7 LM-1 888. 1 From 1 969037 to 1 970329 

SeqID n° 718 LM-189.1 From 1190269 to 1190532 



Unknown, similar to 
unknown proteins 
(hypothetical sensory 
transduction histidine 

kinase) 
Unknown, similar to 

oxidoreductases 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
dihydrodipicolinate 

reductase 
Unknown, similar to 
methylglyoxal synthase 
Unknown, similar to tRNA 
CCA-adding enzyme 
Unknown, similar to 
transcriptional regulator 
and biotin acetyl-CoA- 
carboxylase synthetase 
unknown; similar to 
thioredoxin 
Unknown, hyghly similar to 
Salmonella enterica PduO 
protein 
unknown, similar to 
ketopantoate 
hydroxymethyltransferases 

unknown, similar to 
panthotenate synthetases 
unknown, similar to 
aspartate 1- 
decarboxylases 
unknown, similar to ATP- 
dependent helicases 
unknown, similar to 
hypothetical proteins 
unknown, similar to 

aspartate 
aminotransferases 
unknown, similar to 
sparaginyl-tRNA 
synthetases 
Unknown, similar to carbon 
dioxide concentrating 
mechanism protein 
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SeqlDn°719 LM-1891.1 

SeqlDn°720 LM-1 893.1 

SeqlDn°721 LM-1894.1 

SeqlDn°722 LM-1 896.1 

SeqlDn°723 LM-1897.2 

SeqlDn°724 LM-1 898.2 

SeqlDn°725 LM-19.1 

SeqlDn°726 LM-190.1 

SeqlDn°727 LM-1 900.1 

SeqlDn°728 LM-1904.1 

SeqlDn°729 LM-1905.1 

SeqiDn°730 LM-1906.1 

SeqlDn°731 LM-1907.1 

SeqlDn°732 LM-1908.1 

SeqlDn°733 LM-1 91 0.1 

SeqlDn°734 LM-1 91 1.1 

SeqlDn°735 LM-1912.1 

SeqlDn°736 LM-1913.1 

SeqlDn°737 LM-1915.1 

SeqlDn°738 LM-1916.1 

SeqlDn°739 LM-1 91 8.2 

Seq!Dn°740 LM-1 91 9.2 



From 1 9681 69 to 1 968888 

From 1 967499 to 1 9681 58 

From 1 967015 to 1 967506 

From 1 964489 to 1 966972 

From 1 963852 to 1 964457 

From 2208576 to 221 0351 

From 27231 59 to 2723593 

From 1 189785 to 1 1 90264 

From 2207103 to 2208362 

From 2205709 to 2207016 

From 2204857 to 2205708 

From 2203996 to 2204829 

From 2201719 to 2203980 

From 2200723 to 2201 544 

From 21 99365 to 2200723 

From 2197763 to 21991 15 

From 21 97267 to 21 97728 

From 21 96382 to 21 97206 

From 2194399 to 2196339 

From 2193645 to 2194412 

From 2192693 to 2193448 

From 2191525 to 2192256 



unknown, similar to 
chromosome replication 
initiation protein 
unknown, probable 
endonuclease III (DNA 
repair) 
unknown 
unknown, similar to 
penicillin-binding protein 2A 

unknown, similar to DNA 
repair and homologous 
recombination protein 
Unknown, similar to 
maltogenic amylase 
Unknwon, similar to 
mannitol-specific PTS 
enzyme HA component 
Unknown 
Unknown, similar to 
maltose/maltodextrin- 

binding protein 
Unknown, similar to 
maltodextrin transport 
system permease 
Unknown, similar to 
maltodextrin transport 
system permease 

Unknown, similar to 
maltodextrose utilization 
protein MalA 
Unknown, similar to 
maltosephosphorylase 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
phosphoglucomutase 
Unknown, similar to 
unknown proteins 
unknown 
Unknown, similar to ABC 
transporter (permease) 
Unknown, similar to ABC 
transporter (ATP-binding 
protein) 
unknown, similar to 
unknown proteins 
Unknown, similar to FMN- 
containing NADPH-linked 
nitro/flavin reductase 
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SeqID n° 741 LM-192.1 From 1 188949 to 1 189788 

SeqlDn°742 LM-1920.1 From 2190489 to 2191445 

SeqID n° 743 LM-1921.1 From 2189605 to 2190402 

SeqID n° 744 LM-1922.1 From 2188430 to 2189563 

SeqID n° 745 LM-1923.1 From 2187663 to 2188424 



SeqID n° 746 

SeqID n° 747 

SeqID n° 748 

SeqID n° 749 
SeqID n° 750 

SeqID n° 751 
SeqID n° 752 

SeqID n° 753 

SeqID n° 754 

SeqID n° 755 
SeqID n° 756 
SeqID n° 757 
SeqID n° 758 
SeqID n° 759 
SeqID n° 760 
SeqID n° 761 



LM-1924.1 From 2186557 to 2187411 

LM-1927.1 From 2184344 to 2186338 

LM-1930.1 From 2182823 to 2183800 

LM-1932,1 From 2182218 to 2182784 

LM-1933.2 From 2181329 to 2182216 

LM-1935.1 From 1610206 to 1611165 

LM-1936.1 From 1611449 to 1612405 

LM-1938.1 From 1612395 to 1613279 

LM-1942.1 From 1613497 to 1616823 

LM-1944.1 From 1616945 to 1617880 

LM-1945.1 From 1617901 to 1619214 

LM-1 946. 1 From 1 61 9240 to 1 61 9926 

LM-1947.1 From 1620091 to 1621188 

LM-1 948.1 From 1621230 to 1622342 

LM-1 949.1 From 1622583 to 1623047 

LM-195.1 From 1188293 to 1188928 



SeqID n° 762 LM-1950.1 From 1623099 to 1624292 



Unknown, similar to 
ethanolamine utilization 
protein EuU 
Unknown, similar to 
mannnose-6 phospate 
isomelase 
Unknown, similar to 
hydrolase 
Unknown, similar to N- 
acetylglucosamine-6- 
phosphate deacetylase 

Unknown, similar to 
transcriptional regulator 
(DeoR family) 
Unknown, similar to 
unknown proteins 
Unknown, similar to ferrous 
iron transport protein B 

Unknown, similar to 
phosphotransacetylase 

Unknown 
Unknown, similar to a 
protein required for 
pyridoxine synthesis 
unknown, highly similar to 

6-phosphofructokinase 
unknown, highly similar to 
acetyl CoA carboxylase 

(alpha subunit) 
unknown, highly similar to 
acetyl-CoA carboxylase 

beta subunit 
unknown, highly similar to 
DNA polymerase III (alpha 
subunit) DnaE 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to X-Pro 

dipeptidase 
Unknown, similar to alanine 
dehydrogenase 
Unknown, similiar to 

unknown protein 
unknown, similar to 
Salmonella enterica PduL 

protein 
unknown, highly similar to 
acetate kinase 
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SeqID n° 763 

SeqID n° 764 
SeqID n° 765 
SeqID n° 766 
SeqID n° 767 
SeqID n° 768 

SeqID n° 769 

SeqID n° 770 

SeqID n° 771 

SeqID n° 772 

SeqID n° 773 
SeqID n° 774 

SeqID n° 775 

SeqID n° 776 

SeqID n° 777 

SeqID n° 778 

SeqID n° 779 
SeqID n° 780 



LM-1 951.1 

LM-1 952.1 
LM-1 953.1 
LM-1 954.1 
LM-1 955.1 
LM-1 957.1 

LM-1958.1 

LM-1 959.1 

LM-1 960.1 

LM-1 961 .2 

LM-1 962.2 
LM-1964.1 

LM-1966.2 

LM-1 968.2 

LM-197.1 

LM-1970.1 

LM-1971.1 
LM-1976.1 



82 

From 1624316 

From 1625505 
From 1626064 
From 1626620 
From 1627812 
From 1628647 

From 1629594 

From 1630751 

From 1631516 

From 1632728 

From 1633933 
From 1635146 

From 1636412 

From 1308351 

From 1187552 

From 1310315 

From 1312857 
From 1317596 



to 1625314 

to 1626002 
to 1626600 
to 1627633 
to 1628615 
to 1629591 

to 1630754 

to 1631503 

to 1632712 

to 1633759 

to 1635144 
to 1636285 

to 1638127 

to 1310318 

to 1187989 

to 1312774 

to 1313324 
to 1319464 



Unknown, weakly similar to 
site specific DNA- 
methyltransferase 
Unknown, similar to thiol 
peroxidases 
Unknown, similar to 
unknown proteins 
Unknown, similar to 

proteases 
Unknown, similar to 
unknown proteins 
unknown, highly similar to 
ornithine 
carbamoyltransferase 
unknown, highly similar to 
N-acetylomithine 
aminotransferase 
unknown, highly similar to 
N-acetylglutamate 5- 
phosphotransferase 
unknown, highly similar to 
ornithine acetyltransferase 
and amino-acid 
acetyltransferases 
unknown, similar to N- 
acetylglutamate gamma- 
semialdehyde 
dehydrogenases 
Unknown, similar to thiamin 
biosynthesis protein Thil 
Unknown, similar to iron- 
sulfur cofactor synthesis 

protein nifS 
Unknown, similar to B. 
subtilis negative regulator 
of FtsZ ring formation 
(EzrA) 

unknown, highly similar to 
DNA gyrase-like protein 
(subunit B) 
unknown, similar to 
Salmonella enterica PduK 

protein 
unknown, highly similar to 
DNA gyrase-like protein 
(subunit A) 
unknown, conserved 
hypothetical protein 
unknown; similar to 
acyltransferase (to B. 
subtilis YrhL protein) 
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SeqID n° 781 LM-1978.1 From 1319677 to 1320375 

SeqlDn°782 LM-1979.1 From 1320608 to 1322284 

SeqID n° 783 LM-198.1 From 1187192 to 1187539 

SeqID n° 784 LM-1980.1 From 1322410 to 1323327 

SeqID n° 785 LM-1981.1 From 1323450 to 1323683 

SeqID n° 786 LM-1983.1 From 1323794 to 1325017 

SeqID n° 787 LM-1984.1 From 1325010 to 1326236 

SeqID n° 788 LM-1985.1 From 1326440 to 1326808 

SeqID n° 789 LM-1986.1 From 1326879 to 1328213 

SeqID n° 790 LM-1 988. 1 From 1 328357 to 1 329652 

SeqID n° 791 LM-1989.1 From 1329696 to 1330217 

SeqID n° 792 LM-1991.2 From 1330247 to 1330861 

SeqID n° 793 LM-1992.2 From 1331018 to 1331347 

SeqID n° 794 LM-1995.1 From 1331813 to 1333807 

SeqID n° 795 LM-1996.1 From 1334028 to 1334267 

SeqID n° 796 LM-1997.1 From 1334318 to 1335160 

SeqID n° 797 LM-1998.1 From 1335179 to 1335910 

SeqID n° 798 LM-1999.1 From 1335932 to 1336438 

SeqID n° 799 LM-200.1 From 1 185375 to 1 187195 

Seql D n° 800 LM-2000. 1 From 1 336448 to 1 337752 

SeqID n° 801 LM-2001 .1 From 1337742 to 1338947 



unknown similar to 
glycerophosphodiester 
phosphodiesterase 
Unknown, similar to 
glycerol 3 phosphate 
dehydrogenase 
unknown, similar to diol 
dehydratase-reactivating 

factor small chain 
Unknown, similar to tRNA 
isopentenylpyrophosphate 

transferase 
Unknown, similar to host 
factor- 1 protein 
unknown, conserved 
hypothetical protein similar 
to B. subtilis YnbA protein 
unknown, similar to 
aluminum resistance 
protein and to B. subtilis 
YnbB protein 
Unknown, similar to 
glutamine synthetase 
repressor 
unknown, highly similar to 

glutamine synthetases 
Unknown, similar to arsenic 
efflux pump protein 
unknown, conserved 
hypothetical protein 
unknown, highly similar to 
SOS response regulator 
lexA, transcription 
repressor protein 
unknown, similar to B. 
subtilis YneA protein 
unknown, highly similar to 

transketolase 
unknown, highly similar to 
B. subtilis YneF protein 
unknown 
unknown, weakly similar to 
arginine N- 
methyltransferases 
unknown, similar to E. coli 
YbdM protein 
diol dehydratase- 
reactivating factor large 
subunit 
unknown, similar to E. coli 
YbdN protein 
unknown 
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SeqID n° 802 LM-2002. 1 From 1 338925 to 1 339299 

SeqlDn°803 LM-2003.1 From 1339592 to 1340320 

SeqID n° 804 LM-2004.1 From 1340320 to 1340877 

SeqID n° 805 LM-2005.2 From 1341107 to 1341865 

SeqID n° 806 LM-2006.2 From 290593 to 291228 

SeqID n° 807 LM-2008.1 From 291312 to 293198 

SeqID n° 808 LM-2009.1 From 293212 to 293841 

SeqID n° 809 LM-201.1 From 1 184818 to 1 185330 

SeqID n° 810 LM-2010.1 From 293845 to 295281 

SeqID n° 811 LM-2011.1 From 295401 to 296213 



SeqID n° 812 LM-2012.1 From 296349 to 296852 
SeqID n° 813 LM-2013.1 From 296889 to 297551 
SeqID n° 814 LM-2014.1 From 297810 to 298652 



SeqID n° 815 LM-2015.1 From 298669 to 299490 

SeqID n° 816 LM-2017.1 From 299607 to 300578 

SeqID n° 817 LM-2018.1 From 300617 to 301717 

SeqID n° 818 LM-2019.1 From 302008 to 304140 

SeqID n° 819 LM-202.1 From 1184142 to 1184801 

SeqID n° 820 LM-2020. 1 From 304133 to 304684 



SeqID n° 821 LM-2022.1 From 304942 to 305613 
SeqID n° 822 LM-2023.1 From 305775 to 306554 

SeqID n° 823 LM-2024.1 From 306644 to 307306 



Unknown 
unknown, highly similar to 

uridylate kinases 
unknown, highly similar to 
ribosome recycling factors 

Unknown, similar to 
undecaprenyl diphosphate 
synthase 
Unknown, similar to 
phosphoglycerate mutase 
Unknown, similar to 
transporter 
Unknown 
Unknown, similar to diol 

dehydrase (diol 
dehydratase) gamma 
subunit (pddC) 
unknown, highly similar to 
phospho-beta-glucosidase 

Unknown, conserved 
hypothetical protein similar 
to B. subtilis YxeH protein 
Unknown 
Unknown 
Unknown, C-terminal part 
similar to B. subtilis 

ComEC protein 
Unknown, conserved 
hypothetical protein 
Unknown, similar to 
oxidoreductase 
Unknown, similar to sugar 
ABC transporter, ATP- 
binding protein 
Unknown, highly similar to 
anaerobic ribonucleoside- 
triphosphate reductase 
Unknown, similar to diol 

dehydrase (diol 
dehydratase) gamma 
subunit 
Unknown, highly similar to 
anaerobic ribonucleotide 
reductase activator protein 
Unknown 
Unknown, conserved 
hypothetical protein 
Unknown, similar to ABC 
transporter permease 
protein 
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SeqID n° 824 LM-2026.1 From 307303 to 308319 

SeqID n° 825 LM-2027.2 From 308334 to 309155 

SeqID n° 826 LM-203.1 From 1 182440 to 1 184104 

SeqID n° 827 LM-2035.1 From 2671624 to 267251 1 

SeqID n° 828 LM-2037.1 From 2670042 to 2671523 

SeqID n° 829 LM-2038. 1 From 26691 52 to 2669994 

SeqID n° 830 LM-2039.1 From 2665666 to 2668653 

SeqID n° 831 LM-204.1 From 1 181720 to 1 182421 

SeqID n° 832 LM-2040.1 From 2665190 to 2665666 

SeqID n° 833 LM-2042.1 From 2664343 to 2665128 

SeqID n° 834 LM-2043.1 From 2663619 to 2664296 

SeqID n° 835 LM-2044.1 From 2662243 to 2663622 

SeqID n° 836 LM-2045.1 From 2661076 to 2662164 

SeqID n° 837 LM-2046.1 From 2660408 to 2661076 

SeqID n° 838 LM-2047.1 From 2659852 to 2660145 

SeqID n° 839 LM-2048.2 From 2658966 to 2659841 

SeqID n° 840 LM-2049.3 From 403743 to 404198 

SeqID n° 841 LM-205.1 From 1181338 to 1181625 



SeqID n° 842 LM-2050.1 From 403279 to 403725 
SeqID n° 843 LM-2051 .1 From 402641 to 403060 
SeqID n° 844 LM-2053.1 From 401619 to 402539 



SeqID n° 845 LM-2054.1 From 400997 to 401299 



Unknown, similar to ABC 
transporter (ATP-binding 
protein) 
Unknown, putative 
lipoprotein 
unknown, highly similar to 
propanediol dehydratase, 
alpha subunit 
Unknwon, similar to 
transcription regulator 
TetR/AcrR family 
Unknwon, similar to drug- 
export proteins 
Unknwon, conserved 
hypothetical proteins 
Unknwon, similar to 
formate dehydrogenase 
alpha chain 
unknown, similar to 
Salmonella typhimurium 

PduB protein 
Unknown, similar to B. 
subtilis YrhD protein 
Unknwon, similar to 
formate dehydrogenase 

associated protein 
unknown, similar to two- 
component response 

regulator 
Unknwon, similar to 
response regulator 

histidine kinase 
Unknown, conserved 
hypothetical protein 
Unknwon, similar to ABC 
transporter, ATP-binding 
protein 
Unknwon, conserved 
hypothetical protein 
Unknwon 
Unknown 
unknown, similar to 
Salmonella typhimurium 
PduA protein 
Unknown 
Unknown 
Unknown, similar to 
putative transcription 
regulator 
Unknown, similar to PTS 
betaglucoside-specific 
enzyme 1IB component 
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SeqID n° 846 LM-2055.1 From 399645 to 400979 

SeqID n° 847 LM-2056.1 From 398210 to 399652 

SeqID n° 848 LM-2058.1 From 397340 to 398053 

SeqID n° 849 LM-2059.1 From 396964 to 397299 

SeqID n° 850 LM-2060.1 From 396208 to 396927 

SeqID n° 851 LM-2061 .1 From 395602 to 3961 1 1 

SeqID n° 852 LM-2063.1 From 394264 to 395529 

SeqID n° 853 LM-2064.1 From 393086 to 394246 

SeqID n° 854 LM-2065.1 From 391604 to 393052 

SeqID n° 855 LM-2066.1 From 390529 to 391473 

SeqID n° 856 LM-2067.1 From 389813 to 390430 

SeqID n° 857 LM-2068.1 From 388807 to 389541 

SeqID n° 858 LM-2069.1 From 387697 to 388467 

SeqID n° 859 LM-2070.1 From 386780 to 387640 

SeqID n° 860 LM-2072.1 From 385386 to 386780 

SeqID n° 861 LM-2073.1 From 384926 to 385372 

SeqID n° 862 LM-2074.1 From 383710 to 384726 

SeqID n° 863 LM-2076.1 From 382016 to 383536 



Unknown, similar to PTS 
betaglucoside-specific 
enzyme IIC component 
Unknown, similar to beta- 
glucosidase 
Unknown, similar to 
transcription regulator 

(GntR family) 
Unknown, conserved 
hypothetical protein 
Unknown, conserved 
hypothetical protein, highly 
similar to B. subtilisYeel 
protein 
Unknown, similar to 

different proteins 
Unknown, conserved 
hypothetical protein similar 
to B. subtilis YwbN protein 
Unknown, conserved 
hypothetical protein, 
putative lippoprotein 
Unknown, similar to 
conserved hypothetical 
protein 
Unknown, similar to 
transcription regulator 
Unknown, similar to 
Salmonella typhimurium 
peptidase E 
Unknown, similar to 
conserved hypothetical 
integral membrane protein 

Unknown, similar to 
transcriptional regulator 

(DeoR family) 
Unknown, similar to D- 
fructose-1 ,6-biphosphate 

aldolase 
Unknown, similar to PTS 
system, fructose-specific 
enzyme IIBC component 
Unknown, similar to PTS 
system, enzyme IIA 

component 
Unknown, similar to 

oxidoreductase 
Unknown, similar to 
Flavocytochrome C 
Fumarate Reductase chain 
A 
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SeqID n° 864 LM-2077.1 From 380253 to 381779 

SeqID n° 865 LM-2078.1 From 379794 to 380207 

SeqID n° 866 LM-2080.2 From 378964 to 379728 

SeqID n° 867 LM-2082.3 From 2052233 to 2052664 

SeqID n° 868 LM-2083.1 From 2052816 to 2053301 

SeqID n° 869 LM-2084.1 From 2053274 to 2053801 

SeqID n° 870 LM-2088.1 From 2054493 to 2056187 

SeqID n° 871 LM-2089.1 From 2056205 to 2057926 

SeqID n° 872 LM-2091.1 From 2057927 to 2058418 

SeqID n° 873 LM-2092.1 From 2058516 to 2059511 

SeqID n° 874 LM-2097.1 From 2059669 to 2061207 

SeqID n° 875 LM-2098.1 From 2061209 to 2062261 

SeqID n° 876 LM-2 1 . 1 From 2723786 to 2725852 

SeqID n° 877 LM-2100.1 From 2062263 to 2063651 

SeqID n° 878 LM-2101.1 From 2063638 to 2064219 

SeqID n° 879 LM-2102.1 From 2064238 to 2065506 

SeqID n° 880 LM-2104.1 From 2065698 to 2066417 

SeqID n° 881 LM-2106.1 From 2066455 to 2067756 

SeqID n° 882 LM-2107.3 From 2067887 to 2068897 



Unknown, similar to fatty- 
acid-CoA ligase 
. Unknown, similar to 
unknown proteins 
unknown, highly similar to 
regulatory proteins (DeoR 
family) 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
dihydroxy-acid dehydratase 
Unknown, similar to 
acetolactate synthase 
(acetohydroxy-acid 
synthase) (large subunit) 
Unknown, similar to 
acetolactate synthase 
(acetohydroxy-acid 
synthase) (small subunit) 
Unknown, similar to ketol- 
acid reductoisomerase 
(acetohydroxy-acid 
isomeroreductase) 
Unknown, similar to 2- 
isopropylmalate synthase 
Unknown, similar to 3- 
isopropylmalate 
dehydrogenase 
Unknown, similar to 
transcriptional 
antiterminator 
Unknown, similar to 3- 
isopropylmalate 
dehydratase (large subunit) 
Unknown, similar to 3- 
isopropylmalate 
dehydratase (small 

subunit) 
Unknown, similar to 
threonine dehydratase 
Unknown, similar to alpha- 
acetolactate decarboxylase 
Unknown, similar to 
pyrimidine-nucleoside 

phosphorylase 
Unknown, similar to 
transcription regulators 
(Lad family) 
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SeqlDn°883 LM-2109.2 From 1677409 to 1680009 



SeqlDn°884 LM-2110.1 From 1680132 
SeqlDn°885 LM-2111.1 From 1680644 



SeqID n° 886 

SeqID n° 887 

SeqID n° 888 

SeqID n° 889 
SeqID n° 890 



LM-2113.1 From 1681561 

LM-2114.1 From 1682664 

LM-2115.1 From 1683635 

LM-2116.1 From 1684229 

LM-2119.1 From 1684563 



to 1680560 
to 1681564 

to 1682628 

to 1683638 

to 1684216 

to 1684462 
to 1687265 



SeqID n° 891 LM-2120.1 From 1687420 to 1688223 

SeqID n' 892 LM-2121.1 From 1688248 to 1688670 

SeqID n 6 893 LM-2123.1 From 1688670 to 1691888 

SeqID n° 894 LM-2127.1 From 1691972 to 1695043 

SeqID n° 895 LM-2128.1 From 1695040 to 1696164 

SeqID n 6 896 LM-2129.1 From 1696280 to 1696888 



SeqID n° 897 LM-2130.1 From 1696940 to 1697302 

SeqID n° 898 LM-2131.1 From 1697476 to 1697991 

SeqID n° 899 LM-2133.1 From 1698165 to 1698653 

SeqID n° 900 LM-2134.1 From 1698734 to 1700539 



SeqID n° 901 LM-2135.2 From 1700523 to 1702292 



SeqID n° 902 
SeqID n° 903 
SeqID n° 904 
SeqID n° 905 
SeqID n° 906 
SeqID n° 907 



LM-2 138.1 
LM-214.1 
LM-2140.1 
LM-2141.1 
LM-2142.1 
LM-2 143.1 



From 366508 
From 1177116 
From 367533 
From 367898 
From 369377 
From 370032 



SeqID n" 908 LM-2 1 45. 1 From 370732 
SeqID n° 909 LM-2147.1 From 371 130 



to 367032 
to 1177865 
to 367901 
to 369373 
to 369757 
to 370403 

to 371064 
to 373127 



Unknown, similar to 
Alcohol-acetaldehyde 

dehydrogenase 
Unknown, similar to 
unknown proteins 
Unknown, similar to similar 
to ABC transporter (ATP- 
binding protein) 
Unknown, similar to 
membrane proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to dna-3- 
methyladenine glycosidase 

Unknown 
unknown, highly similar to 
aconitate hydratases 
Unknown, similar to 
putative sigma factor 
regulator 
Unknown 
Unknown, similar to SNF2- 

type helicase 
Unknown, similar to ATP- 
dependent dsDNA 
exonuclease SbcC 
unknown, similar to 
putative exonucleases 
SbcD 

unknown, similar to 1- 
acylglycerol-3-phosphate 
O-acyltransferases 
unknown 
unknown 
unknown, similar to 
hypothetical proteins 
unknown, similar to ABC 
transporter (ATP-binding 

protein) 
unknown, similar to ABC 
transporter (ATP-binding 
protein) 
unknown 
unknown 
Unknown 
Unknown 
Unknown 
Unknown, weakly similar to 
inorganic pyrophosphatase 
Unknown 
Unknown, similar to 
transketolase 
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SeqID n° 910 


LM-2148.1 


From 373129 


to 373785 


SeqID n° 911 


LM-215.1 


From 1176660 


to 1177091 


SeqID n° 912 


LM-2150 1 


rium of oooz 


to 374596 


SeqID n 6 913 


LM-2151.1 


From 374621 


to 375067 


SeqID n* 914 


LM-2152 1 


riuiii ot out *t 


f— 07CMO 

to o75838 


SeqID n° 915 


LM-2154.1 


From 375842 


to 376492 


SeqID n° 916 


LM-2155.1 


From 376514 


to 377509 


SeqID n° 917 
SeqID n° 918 
SeqID n° 919 


LM-2156.1 
LM-2157.2 
LM-2158.2 


From 377531 
From 377888 
From 378404 


to 377872 
to 378319 
to 378781 


SeqID n° 920 


LM-216.1 


From 1176304 


to 1176654 


SeqID n° 921 


LM-2161.2 


From 72408 


to 74222 


SeqID ri° 922 
SeqID n° 923 
SeqID n° 924 
SeqID n" 925 
SeqID n° 926 


LM-2162.1 
LM-2164.1 
LM-217.1 


From 72062 
rrorn / loou 
From 71064 
rrom fvofo 
From 1175602 


to 72394 
to 72061 
to 71363 
to 71071 
to 1176156 


SeqID n° 927 


LM-2170.1 


From 66160 


to 70656 


SeqID n° 928 


LM-2171.1 


From 64950 


to 66146 


SeqID n° 929 


LM-2172.1 


From 64677 


to 64928 


SeqID n° 930 
SeqID n° 931 


LM-2174.1 
LM-2178.1 


From 64144 
From 60948 


to 64659 
to 64154 


SeqID n 8 932 


LM-2179.1 


From 60506 


to 60799 



SeqID n° 933 LM-218.1 From 1 174241 to 1 175605 
SeqID n° 934 LM-2180.2 From 58897 to 60189 
SeqID n° 935 LM-2182.2 From 1814403 to 1815080 



Unknown, similar to 

transaldolase 
unknown, similar to 
Salmonella enterica PduV 
protein 
Unknown, similar to 
dehydrogenase/reductase 
Unknown, similar to sugar- 
phosphate isomerase 
Unknown, similar to 
triosephosphate isomerase 

Unknown, similar to 
dihydnoxyacetone kinase 

Unknown, similar to 
dihydroxyacetone kinase 
Unknown 
Unknown 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
Salnionelle enterica PduU 

protein 
Unknwon, similar to toxin 
components 
Unknwon 
Unknwon 
Unknwon 
unknwon 
Unknown, similar to 
Salmonella enterica PduT 

protein 
Unknwon, highly similar to 
B. subtilis YukA protein 
Unknwon, similar to B. 
subtilis YukC protein 
Unknwon, similar to B. 
subtilis YukD protein 

Unknwon 
Unknwon, similar to B. 
subtilis YueB protein 
Unknwon, similar to a small 

heat shock protein of 
Clostridium acetobutylicum 

unknown, similar to 
Salmonella enterica PduS 

protein 
unknown, highly similar to 
adenylosuccinate 
synthetase 
Unknown, similar to two- 
component response 
regulator 
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SeqID n" 936 

SeqID n° 937 
SeqID n° 938 

SeqID n° 939 



SeqID n° 941 

SeqID n° 942 

SeqID n° 943 
SeqID n° 944 
SeqID n° 945 

SeqID n° 946 

SeqID n° 947 

SeqID n° 948 

SeqID n° 949 

SeqID n° 950 

SeqID n° 951 

SeqID n° 952 

SeqID n° 953 
SeqID n° 954 
SeqID n° 955 
SeqID n° 956 
SeqID n° 957 
SeqID n° 958 

SeqID n° 959 



LM-2184.1 

LM-2 185.1 
LM-2186.1 

LM-2188.1 



LM-2194.1 
LM-2195.1 
LM-2196.1 



LM-22.1 



LM-220.1 



LM-2207.2 
LM-2209.1 
LM-221.1 
LM-2210.1 
LM-2212.1 
LM-2213.1 



From 1813457 

From 1812978 
From 1811219 

From 1810157 



From 1806446 
From 1805925 
From 1805041 



From 2726008 



From 1173643 



From 2505425 
From 2505858 
From 1173144 
From 2506494 
From 2506867 
From 2507306 



to 1814332 

to 1813436 
to 1812961 

to 1811197 
to 1809759 

to 1809107 

to 1808458 

to 1807552 
to 1806443 
to 1805928 

to 1804848 

to 2727195 

to 1174122 

to 1800241 

to 1798727 

to 1797880 

to 1796925 

to 2505835 
to 2506370 
to 1173551 
to 2506835 
to 2507250 
to 2508202 

to 2508816 



LM-2214.1 From 2508247 



SeqID n° 940 LM-2189.1 From 1809118 
LM-2191.1 From 1808460 
LM-2193.1 From 1807643 



LM-2199.1 From 1800256 



LM-2201.1 From 1798772 

LM-2203.1 From 1797897 

LM-2204.1 From 1796987 

LM-2205.2 From 1795681 



Unknown, similar to 
unknown proteins 
Unknown 
unknown, highly similar to 

adenine deaminases 
Unknown, similar to two- 
component sensor histidine 
kinase 

Unknown, similar to amino 
acid (glutamine) ABC 
transporter, permease 
protein 
Unknown, similar to amino 

acid (glutamine) ABC 
transporter (ATP-binding 

protein) 
Unknown, similar to amino 
acid ABC transporter 

(binding protein) 
Unknown, similar to 
glycerol dehydrogenase 
Unknown, similar to 
unknown proteins 
transcription activator of 
glutamate synthase operon 
GltC 

Unknown, similar to 
glutamate synthase (large 

subunit) 
unknown, highly similar to 
translation elongation 
. factor EF-Tu 
unknown, similar to 
uroporphyrin-lll C- 
methyltransferase 
Unknown, similar to 
glutamate synthase (small 

subunit) 
Unknown, similar to sugar 

transport protein 
Unknown, similar to sugar 

transport protein 
Unknown, similar to sugar 
binding protein 
Unknown 
unknown 
unknown 
Unknown 
unknown 
Unknown, similar to 
transcription regulator 
Unknown 
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SeqID n° 960 
oeqiu n yo i 


LM-2215.1 

LIVI-Z^IO. 1 


From 2508992 
hrom 2510361 


to 2509411 
to 2514293 


oeqiu n yo^: 


1 M 9917 1 


rrom i4o0o 


10 2515210 


oeqiu n yoo 


1 M 991 Q 1 


rrom zoioolz 


to 2518587 


ocqiLs n PO*f 


1 M-999H 1 


Prom 9£1flQ71 


to 2518879 




I M-9991 1 


From 9^1 QQQ^ 


to 252Uoyy 


SeqID n" 966 


LM-2222.1 


From 2520415 


to 2522796 


SeqID n° 967 


LM-2224.2 


From 2522830 


to 2523576 


SeqID n° 968 


LM-2225.2 


From 2343056 


to 2343685 


SeqID n° 969 


LM-2226.1 


From 2341865 


to 2343013 


SeqID n° 970 


LM-2227.1 


From 2341064 


to 2341789 


SeqID n° 971 


LM-2228.1 


From 2339611 


to 2341071 


OcqiL* ii sjl c. 


I M 999Q 1 


From 9*V}ft/19n 

rrom zooohzu 


to 2ooy418 


SeqID n° 973 
SeqID n° 974 


LM-223.1 
LM-2230.1 


From 1172797 
From 2337786 


to 1173036 
to 2338406 


SeqID n° 975 


LM-2232.1 


From 2336515 


to 2337399 


SeqID n° 976 


LM-2233.1 


From 2335862 


to 2336518 


SeqID n° 977 


LM-2234.1 


From 2335468 


to 2335860 


SeqID n° 978 


LM-2235.1 


From 2334586 


to 2335455 


SeqID n° 979 


LM-2236.1 


From 2334003 


to 2334569 


SeqID n° 980 


LM-2237.1 


From 2333375 


to 2333857 


SeqID n° 981 


LM-2238.1 


From 2332888 


to 2333283 



unknown 
Unknown, similar to 

glycosidase 
Unknown, similar to 

internalin 
Unknown, similar to 

glycosidase 
Unknown, similar to 
transcription regulator 
Unknown, conserved 
hypothetical protein 
Unknwon, similar to 
exoribonuclease RNase-R 
Unknown, similar to 
carboxylesterase 
Unknown, similar to 
phosphoglucomutase 
Unknown, similar to 
aspartate aminotransferase 
Unknown, similar to amino 
acid ABC transporter (ATP- 

binding protein) 
Unknown, similar to amino 
acid ABC transporter, 
permease protein 
Unknown, similar to low- 
affinity inorganic phosphate 
transporter 
unknown 
Unknown, similar to 
unknown proteins 
Unknown, similar to 

oxidoreductase 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
putative ribosomal large 
subunit pseudouridine 

synthase 
Unknown, similar to 
methylphosphotriester- 
DNA alkyltransferase and 
transcriptional regulator 
Unknown, similar to 06- 
methyfguanine-DNA 
methyltransferase 
Unknown, similar to 
transcriptional regulators 
(GntR family) 
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SeqID n° 982 


LM-2239.1 


From 2332020 


to 2332898 


SeqID n° 983 


LM-224.1 


From 1172076 


to 1172648 


SeqID n° 984 


LM-2241.1 


From 2330959 

C rrtm OOOQ7Q4 


to 2332023 
to 2330921 






c rr , m oooooco 

rrom zozoooo 


to 2329571 


SeqID n° 987 


LM-2246.1 


From 2327461 


to 2328330 


OCLjlU II 9UO 




rrom coiXAll 


to 2327402 


SeqID n° 990 


1 M OIK *f 

LM-2250.1 


rrom n 71662 
From 2324554 


to 1172009 
to 2325408 


SeqID n° 991 


LM-2251.1 


From 2323375 


to 2324253 


SeqID n° 992 


LM-2253.1 


From 2322031 


to 2323335 


SeqID n° 993 


LM-2254.2 


From 2320906 


to 2321775 


SeqID n° 994 


LM-2255.2 


From 2320416 


to 2320841 


SeqID n° 995 
SeqID n° 996 


LM-2256.3 
LM-2257.3 


From 1405578 
From 1404714 


to 1405859 
to 1405541 


SeqID n° 997 


LM-2258.3 


From 1403227 


to 1404678 


SeqID n° 998 


LM-2260.1 


From 1402550 


to 1403230 


SeqID n° 999 


LM-2261.2 


From 1400941 


to 1402359 


SeqID n° 1000 


LM-2262.2 


From 1399699 


to 1400796 


SeqID n° 1001 


LM-2263.1 


From 1398077 


to 1399327 


SeqID n° 1002 


LM-2266.1 


From 1397063 


to 1398046 



Unknown, similar to ABC 
transporter (ATP-binding 

protein) 
Unknown, similar to ATP- 
dependent Clp protease 
proteolytic component 

Unknown 
Unknown, similar to 
transport system permease 
protein 
Unknown, similar to 
transport system permeate 
protein 
Unknown, similar to 
oxidoreductase 
Unknown, similar to NADH 
oxidase 
unknwon 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
transcriptional regulators 
(LysR family) 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
arsenate reductase 
Unknown 
Unknown, similar to B. 
subtilis Spoil I J protein 
two-component sensor 

histidine kinase 
two-component response 
regulator 
Unknown, similar to 6- 
phosphogluconate 

dehydrogenase 
Unknown, similar to 
aminotripeptidase. 
Unknown, similar to 
branched-chain alpha-keto 
acid dehydrogenase E2 
subunit (lipoamide 
acyltransferase) 
Unknown, similar to 
branched-chain alpha-keto 
acid dehydrogenase E1 
subunit (2-oxoisovalerate 
dehydrogenase beta 
subunit) 
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SeqlDn°1003 LM-2267.1 From 1396051 to 1397046 



SeqlDn°1004 LM-2269.1 From 1394599 to 1396026 

SeqlDn°1005 LM-2270.1 From 1393517 to 1394584 

SeqlDn°1006 LM-2271.1 From 1392514 to 1393380 

SeqlDn°1007 LM-2273.1 From 1390694 to 1392385 

SeqID n° 1008 LM-2274.1 From 1390222 to 1390671 

SeqID n° 1009 LM-2275.1 From 1389207 to 1390031 

SeqID n° 1010 LM-2276.1 From 1387396 to 1389210 

SeqID n° 101 1 LM-2278.1 From 1385934 to 1386815 

SeqID n° 1012 LM-2280.1 From 1384362 to 1385714 

SeqID n° 1013 LM-2283.1 From 1383489 to 1384343 



SeqID n° 1014 LM-2284.1 From 1383007 to 1383393 

SeqID n° 1015 LM-2285.1 From 1382569 to 1382976 

SeqID n° 1016 LM-2287.3 From 1381201 to 1382565 

SeqID n° 1017 LM-2289.3 From 1380720 to 1381187 

SeqID n° 1 01 8 LM-2290.3 From 1 380004 to 1 380561 

SeqID n° 1019 LM-2291.2 From 1275845 to 1276693 



Unknown, similar to 
branched-chain alpha-keto 
acid dehydrogenase E1 
subunit (2-oxoisovalerate 
dehydrogenase alpha 

subunit) 
Unknown, similar to 
branched-chain alpha-keto 
acid dehydrogenase E3 
subunit 
Unknown, similar to 
branched-chain fatty-acid 
kinase 
Unknown, similar to 
phosphotransbutyrylase 
DNA repair and genetic 
recombination 
Unknown, similar to 
arginine repressor 
Unknown, conserved 
hypothetical protein 
Unknown, similar to D-1- 
deoxyxylulose 5-phosphate 
synthase 
Unknown, similar to 
geranyltranstransferase 

Unknown, similar to 
exodeoxyribonuclease VII 

(large subunit) 
unknown, highly similar to 
methylenetetrahydrofolate 

dehydrogenase and 
methenyltetrahydrofolate 
cyclohydrolase 
Unknown, similar to 
transcription termination 

protein (NusB) 
unknown, similar to B. 
subtilis YqhY protein 
acetyl-CoA carboxylase 
subunit (biotin carboxylase 

subunit) 
Unknown, similar to acetyl- 
CoA carboxylase subunit 
(biotin carboxyl carrier 
subunit) 
unknown, highly similar to 
elongation factor P (EF-P) 
unknown, similar to B. 
subtilis YxkD protein 
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SeqIDn°1020 LM-2292.1 From 1274883 to 1275542 



SeqID n° 1021 LM-2293.1 From 1273473 to 1274678 



SeqlDn° 1022 LM-2294.1 From 1273231 to 1273476 

SeqID n° 1023 LM-2295.1 From 1272717 to 1273193 

SeqID n° 1024 LM-2296.1 From 1272286 to 1272549 

SeqID n° 1025 LM-2298.1 From 1270849 to 1272261 



SeqID n° 1026 LM-2299.1 From 1269834 to 1270433 



SeqID n° 1027 LM-230.1 From 1 168390 to 1 169541 

SeqID n° 1028 LM-2300.1 From 1269262 to 1269669 

SeqID n° 1029 LM-2301.1 From 1268515 to 1269108 

SeqID n° 1030 LM-2302.1 From 1267115 to 1268473 

SeqID n° 1031 LM-2303.1 From 1266040 to 1266564 



SeqID n° 1032 LM-2304.1 From 1265392 to 1266003 

SeqID n° 1033 LM-2305.1 From 1264642 to 1265388 

SeqID n° 1034 LM-2306.1 From 1263829 to 1264629 

SeqID n° 1035 LM-2307.1 From 1263187 to 1263681 

SeqID n° 1036 LM-2308.1 From 1261917 to 1263131 

SeqID n° 1037 LM-231.1 From 1168021 to 1168368 

SeqID n° 1038 LM-2310.1 From 1259917 to 1261728 

SeqID n° 1039 LM-2312.1 From 1259530 to 1259841 

SeqID n° 1040 LM-2315.1 From 1257092 to 1259449 

SeqID n° 1041 LM-2317.1 From 1255357 to 1257069 

SeqID n° 1042 LM-2318.3 From 1254722 to 1255264 



unknown, similar to 
regulator of the Fnr CRP 
family (including PrfA) 
unknown; similar to 
antibiotic resistance protein 

unknown 
unknown, weakly similar to 
8-oxo-dGTPase (mutT) 

unknown 
Unknown, similar to ATP- 
dependent RNA helicase 

(DEAD motif) 
unknown weakly similar to 
phosphoglycerate mutase 
1 

unknown 
unknown 
unknown, similar to B. 
subtilis Ydel protein 

unknown 
unknown, conserved 
hypothetical protein, similar 
to B. subtilis YsnB protein 

unknown, conserved 
hypothetical protein, similar 
to B. subtilis YsnA protein 

Unknown, similar to 

ribonuclease PH 
Unknown, similar to 
glutamate racemase 
unknown, similar to B. 
subtilis YsIB protein 
Unknown, similar to 
aspartokinase II alpha 
subunit 
unknown, similar to 
regulatory proteins 
unknown, highly similar to 
excinuclease ABC subunit 
C 

thioredoxin 
unknown, similar to MutS 

protein (MutS2) 
unknown, similar to DNA 
polymerase beta, to B. 
subtilis YshC protein 
unknown, similar to B. 
subtilis YshB protein 
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SealD n° 104^ 




rrom o4/ooo 


to 348428 


SeqID n° 1044 


LM-232.1 


From 1167645 


to 1167953 


SeqID n° 1045 


LM-2320.1 


From 347219 


to 347611 


SeqID n° 1046 


LM-2322.1 


From 346377 


to 347036 


SeqID n° 1047 


LM-2324.1 


From 343221 


to 344636 


SeqID n° 1048 


LM-2325.1 


From 342551 


to 343195 


SealD n° 1049 


l— IVI £.\JC. 1 . f 


rrom ohI /ol 


to 342554 


SeqID n° 1050 


LM-2329.1 


From 340945 


to 341754 


SeqID n° 1051 


LM-2332.1 


From 340278 


to 340952 


SeqID n° 1052 


LM-2333.1 


From 339270 


to 340055 


SeqID n° 1053 


LM-2334.1 


From 338341 


to 339087 


SeqID n° 1054 


LM-2335.2 


From 337161 


to 338363 


SealD n° 1055 
SeqID n° 1056 
SeqID n° 1057 


1— IVI tJOU.O 

LM-2338.2 
LM-2339.1 


rrom oooo4l 
From 1026871 
From 1026376 - 


to 337161 
to 1029045 
to 1026855 


SeqID n° 1058 


LM-234.1 


From 1165962 


to 1167608 


SeqID n° 1059 


LM-2340.1 


From 1025159 


to 1026190 


SeqID n° 1060 
SeqID n° 1061 


LM-2341.1 
LM-2342.1 


From 1024761 
From 1023346 


to 1025129 
to 1024680 



SeqID n° 1062 LM-2344.1 From 1022535 to 1023302 

SeqID n° 1063 LM-2345.1 From 1021665 to 1022432 

SeqID n° 1064 LM-2346.1 From 1020022 to 1021383 

SeqID n° 1065 LM-2347.1 From 1019545 to 1019970 



Unknown, similar to 
unknown proteins 
unknown, similar to B. 
subtilis YjcS protein 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
phospho-beta-glucosidase 
Unknown, similar to 
thiamin-phosphate 
pyrophosphorylase (ThiE) 

Unknown, similar to 
phosphomethylpyrimidine 
kinase (ThiD) 
Unknown, similar to 
hydroxyethylthiazole kinase 
(ThiM) 

Unknown, similar to thiamin 
biosynthesis protein 
Unknown, similar to 

unknown protein 
Unknown, conserved 
hypothetical protein 
Unknown, similar to 
unknown proteins 
Unknown 
ATP-dependent protease 

unknown, similar to 
methylated-DNA-protein- 
cystein methyltransferase 
unknown, similar to ABC 
transporters, ATP-binding 
proteins (HI0664) 
unknown, similar to B. 
subtilis YkrP protein 
unknown 
unknown, similar to Na+- 
transporting ATP synthase 
subunit J 
unknown, conserved 
hypothetical protein 
unknown, conserved 
hypothetical protein 
unknown, conserved 
hypothetical protein 
unknown, similar to 
regulatory proteins (MarR 
family) 
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SeqIDn 0 1066 LM-2348.1 From 1017870 to 1019438 unknown, similar to peptide 

chain release factor 3 (RF- 

SeqlDn°1067 LM-2349.1 From 1016934 to 1017788 unknown^similar to 

Streptococcus agalactiae 

SeqIDn 0 1068 LM-235.2 Froml 164245 to 1165960 ATP-bfndmTSnsport 

SeqIDn 0 1069 LM-2350.1 From 1016030 to 1016941 unkno^similar to 

antibiotic ABC transporter, 

SeqIDn 0 1070 LM-2351.1 From 1015605 to 1016033 ATP " b unkn n o^l ^0te,n, 
SeqIDn 0 1071 LM-2352.1 From 1015153 to 1015608 unknown, weakly similar to 

two-component response 
„ regulator 
SeqIDn 0 1072 LM-2353.1 From 1014540 to 1015019 unknown, similar to 

SeqIDn 0 1073 LM-2355.1 From 1013475 to 1014527 9 'unkn^ 

o rt glucanase and peptidase 

SeqIDn 1074 LM-2358.1 From 1011860 to 1013335 unknown, similar to efflux 

SeqIDn 0 1075 LM-2359.1 From 1011074 to 1011826 unknowS^^ 

transporter transmembrane 
o .r. 3 component 
SeqIDn 1076 LM-236.2 From 1163277 to 1163996 unknown, similar to 

^ „ ^ transcription regulators 

SeqIDn 0 1077 LM-2361.1 From 1010313 to 1011077 unknown, similar to 

daunorubicin resistance 
^ ATP-binding proteins 

SeqIDn 0 1078 LM-2362.1 From 1009098 to 1010117 unknown; similar to 

branched-chain amino acid 
_ o aminotransferase 
SeqIDn 1079 LM-2363.1 From 1008130 to 1008879 unknown, similar to B. 

SeqIDn 0 1080 LM-2364.1 From 1007669 to 1008112 unknot 
o .r^ 0 .^. ■ subtilis YjcF protein 

SeqIDn 0 1081 LM-2365.2 From 1006959 to 1007633 unknown, similar to ribose 
„ ,^ n „ 5-phosphate isomerase 

SeqIDn 0 1082 LM-2366.2 From 1937783 to 1938415 unknown, similar to 

hemolysinlll proteins, 
putative integral membrane 
protein 

SeqIDn 0 1083 LM-2368.1 From 1938538 to 1939155 unknown, similar to 

conserved hypothetical 
proteins 

SeqIDn 0 1084 LM-2369.1 From 1939168 to 1939980 unknown, similar to 

conserved hypothetical 
proteins 

SeqIDn" 1085 LM-237.1 From 1162666 to 1163280 Unknown, similar to 
o ir^ » „„„„ unknown proteins 

SeqIDn" 1086 LM-2371.1 From 1939999 to 1942638 unknown, similar to 

pyruvate phosphate 
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SeqlDn°1087 LM-2372.1 From 1942687 to 1943064 



SeqlDn°1088 LM-2373.1 From 1943068 to 1944051 



SeqID n° 1089 

SeqID n° 1090 

SeqID n° 1091 

SeqID n° 1092 
SeqID n° 1093 

SeqID n° 1094 

SeqID n° 1095 



LM-2375.1 From 1944145 to 1944768 

LM-2378.1 From 1944908 to 1946419 

LM-2379.1 From 1946488 to 1947363 

LM-238. 1 From 1 1 61689 to 1 1 62696 

LM-2380.1 From 1947416 to 1947898 

LM-2381.1 From 1947914 to 1948858 

LM-2385.1 From 1948871 to 1950763 



SeqID n° 1096 LM-2386.1 From 1950883 to 1951305 



SeqID n° 1097 LM-2387.2 From 1951284 to 1952564 



SeqID n° 1098 LM-2388.2 From 1952721 to 1953149 

SeqID n" 1099 LM-239.1 From 1161220 to 1161675 
SeqID n° 1100 LM-2390.2 From 148354 to 150009 



SeqID 
SeqID 
SeqID 
SeqID 
SeqID 
SeqID 
SeqID 
SeqID 



n°1101 
n° 1102 
n°1103 
n°1104 
n°1105 
n°1106 
n°1107 
n°1108 



LM-2391.1 
LM-2392.1 
LM-2393.1 
LM-2394.1 
LM-2395.1 
LM-2397.1 
LM-2398.1 
LM-24.1 



From 147884 
From 147399 
From 146764 
From 146306 
From 145869 
From 144397 
From 144097 
From 2727304 



to 148291 
to 147764 
to 147381 
to 146656 
to 146306 
to 144840 
to 144273 
to 2729391 



SeqID n° 1109 LM-240.1 From 1160778 to 1161230 

SeqID n° 1110 LM-2400.1 From 143366 to 143722 

SeqID n° 1111 LM-2401.1 From 141618 to 143006 

SeqID n° 1112 LM-2402.1 From 141337 to 141705 

SeqID n° 1113 LM-2404.1 From 141052 to 141336 



dikinase 

unknown, similar to 
conserved hypothetical 
proteins 
unknown, similar to 
conserved hypothetical 
proteins, putative integral 

membrane protein 
unknown; similar to alkaline 
phosphatase 
unknown, similar to 
phosphoglucomutases 
unknown, similar to 
methyltransferases 

unknown 
unknown, similar to 
dihydrofolate reductases 
unknown, similar to 
thymidylate synthase 
unknown, similar to 
putative ABC transporters 

(ATP-binding protein) 
Unknown, similar to formyl- 
tetrahydrofolate synthetase 

C-terminal part 
Unknown, similar to formyl- 
tetrahydrofolate synthetase 
N-terminal part 
unknown, similar o 
transcriptional regulators 

unknown 
Unknown, similar to ABC 
transporter oligopeptide- 
binding protein 
Unknwon 
unknwon 

unknown 

Unknwon 

Unknown 

unknown 
unknown, highly similar to 
translation elongation 

factor G 
unknown, similar to E. coli 
YjaB protein 

unknown 

Unknwon 

Unknwon 

Unknwon 
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SeqID n° 1 1 14 LM-2406.1 From 139960 to 140853 



SeqlDn°1115 LM-2409.1 From 138999 to 139949 



SeqID n° 1 1 16 LM-241 .2 From 1 160220 to 1 160753 
SeqID n° 1117 LM-2411.2 From 137323 to 138897 



SeqID n° 1118 

SeqID n° 1119 

SeqID n° 1120 

SeqID n°1121 
SeqID n°1122 



LM-2412.2 

LM-2414.1 

LM-2415.1 

LM-2416.1 
LM-2417.1 



From 2164106 

From 2166012 

From 2167353 

From 2168054 
From 2169312 



to 2165377 

to 2167343 

to 2167943 

to 2169097 
to 2170526 



SeqID n° 1 123 LM-2418.1 From 2170530 to 2171900 



SeqID n° 1124 
SeqID n°1125 



LM-242.2 
LM-2421.1 



From 1159393 
From 2172068 



to 1159830 
to 2173591 



SeqID n° 1126 LM-2423.1 From 2173836 to 2174486 

SeqID n° 1127 LM-2424.1 From 2174488 to 2175420 

SeqID n° 1128 LM-2426.1 From 2175438 to 2176787 

SeqID n° 1129 LM-2428.1 From 2176815 to 2177090 

SeqID n° 1130 LM-2429.1 From 2177096 to 2177563 

SeqID n° 1 131 LM-243.1 From 1 158930 to 1 159403 

SeqID n° 1132 LM-2431.1 From 2177571 to 2179577 

SeqID n° 1133 LM-2432.2 From 2179758 to 2181203 



SeqID n° 1134 LM-2433.2 From 161420 to 162250 
SeqID n° 1 135 LM-2439.1 From 155993 to 156805 



Unknwon, similar to 
oligopeptide ABC 
transporter, permease 
protein 
Unknwon, similar to 
oligopeptide ABC 
transporter, permease 
protein 
unknown 
Unknwon, similar to 
oligopeptide ABC transport 
system substrate-binding 

proteins 
Unknown, weakly similar to 
transcription regulators 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
transcription regulators 
Unknown, similar to lipases 

Unknown, similar to 
argininosuccinate synthase 
Unknown, similar to 
argininosuccinate lyase 
unknown 
glycine betaine transporter 
BetL 

Unknown, similar to L- 
fuculose-phosphate 
aldolase 
Unknown, similar to 1- 
phosphofructokinase 
Unknown, similar to PTS 
system galactitol-specific 
enzyme IIC component 
Unknown, similar to PTS 
system galactitol-specific 
enzyme I IB component 
Unknown, similar to PTS 
system galactitol-specific 
enzyme HA component 
unknown 
Unknown, similar to 
transcription antiterminator 
Unknown, similar to 
transcriptional regulator 

(GntR family) and to 
aminotransferase (MocR- 
like) 
unknwon 
Unknwown, conserved 
hypothetical protein 
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SeqlDn°1136 LM-244.1 From 1158425 to 1158844 

SeqlDn°1137 LM-2441.1 From 153608 to 155947 

SeqlDn°1138 LM-2442.1 From 152648 to 153328 

SeqlDn°1139 LM-2444.1 From 151839 to 152645 



SeqID n° 1140 LM-2446.1 From 151186 to 151890 



SeqlDn°1141 LM-2447.2 From 150232 to 151173 



SeqlDn°1142 LM-2448.2 From 2635167 to 2637920 

SeqID n° 1143 LM-245.1 From 1158020 to 1158388 

SeqlDn°1144 LM-2450.1 From 2637964 to 2639562 

SeqlDn°1145 LM-2452.1 From 2639930 to 2640466 



SeqlDn°1146 LM-2453.1 From 2640832 to 2642502 

SeqlDn°1147 LM-2455.1 From 2642499 to 2642948 

SeqID n° 1148 LM-2456.1 From 2643026 to 2643679 

SeqID n° 1149 LM-2457.1 From 2643973 to 2645295 



SeqID n° 1150 LM-2458.1 From 2645310 

SeqlDn°1151 LM-2459.1 From 2646686 

SeqID n° 1 1 52 LM-246. 1 From 1 157592 

SeqlDn°1153 LM-2460.1 From 2647164 



SeqID n° 1 154 
SeqID n° 1155 

SeqID n" 1156 

SeqID n° 1157 

SeqID n° 1158 
SeqID n°1159 

SeqID n' 1160 
SeqID n° 1161 

SeqID n°1162 
SeqID n°1163 



LM-2461.1 
LM-2462.1 

LM-2463.1 

LM-2464.1 

LM-2465.1 
LM-2467.2 

LM-2469.1 
LM-247.3 

LM-2470.1 
LM-2473.1 



From 2648960 
From 2649598 

From 2650231 

From 2650758 

From 2651853 
From 2652174 

From 331543 
From 1 156384 

From 331063 
From 329923 



to 2646146 
to 2647009 
to 1158008 
to 2648825 



to 2649574 
to 2650230 

to 2650755 

to 2651756 

to 2652125 
to 2653085 

to 332688 
to 1157241 

to 331446 
to 330999 



unknown 
Unknwon, similar to ATP 
dependent helicase 
Unknwon 
Unknwon, similar to high- 
affinity zinc ABC 
transporter (membrane 

protein) 
Unknwon, similar to high- 
affinity zinc ABC 
transporter (ATP-binding 
protein) 
Unknown, similar to a 
probable high-affinity zinc 
ABC transporter (Zn(ll)- 
binding lipoprotein) 
autolysin, amidase 
unknown 
unknown, highly similar to 
CTP synthases 
Unknown, similar to B. 
subtilis RNA polymerase 

delta subunit 
arginyl tRNA synthetase 
Unknwon 
Unknwon, conserved 
hypothetical protein 
Unknown, conserved 
hypothetical protein 
Unknwon 
Unknown 
unknown 
Unknwon, similar to 
dipeptide ABC transporter 
(dipeptide-binding protein) 
Unknwon 
Unknwon, similar to 
nicotinamidase 
Unknwon, similar to Chain 
A, Dihydrofolate Reductase 
Unknown, similar to zinc- 
binding dehydrogenase 
Unknwon 
Unknwon, similar to cation 
transport protein 
Unknown 
unknown, similar to 
methylases 
Unknown 
Unknown, similar to low 
specificity L-allo-threonine 
aldolase 
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SeqlDn°1164 LM-2474.1 From 328701 to 329966 

SeqID n° 1 165 LM-2475. 1 From 327941 to 328495 

SeqlDn°1166 LM-2476.1 From 327621 to 327905 

SeqID n° 1 167 LM-2477.1 From 327113 to 327454 



SeqID n° 1 168 LM-2478.1 From 325729 to 327120 



SeqID n° 1 169 LM-2480. 1 From 325422 to 325712 



SeqID n° 1 170 LM-2482.1 From 324104 to 325417 



SeqID n° 1171 LM-2483.1 From 322134 to 324005 



SeqID n° 1172 LM-2485.1 From 321490 to 322128 
SeqID n° 1173 LM-2486.1 From 320542 to 321279 



SeqID n° 1 174 LM-2487.1 From 319560 to 320423 



SeqID n° 1175 LM-2488.2 From 319049 to 319528 



SeqID n° 1176 LM-2489.2 From 2118812 to 2119786 



SeqID n° 1 177 LM-249.2 From 2788504 to 2790243 



SeqID n° 1178 LM-2490.1 From 2117304 to 2118671 



SeqlDn°1179 LM-2491.1 From2116216 to2117307 



SeqID n° 1180 LM-2493.1 From 2115381 to 2116193 



Unknown 
Unknown, putaive 
secreted, lysin rich protein 

Unknown 
Unknown, similar to PTS 
beta-glucoside-specific 
enzyme MA component 
Unknown, similar to 
phospho-beta-glucosidase 
and phospho-beta- 
galactosidase 
Unknown, similar to PTS 
beta-glucoside-specific 
enzyme IIB component 
Unknown, similar to PTS 
beta-glucoside-specific 
enzyme IIC component 
Unknown, similar to 
transcriptional 
antiterminator (BgIG family) 

Unknown 
Unknown, similar to FMN- 
containing NADPH-linked 
nitro/flavin reductase 
Unknown, similar to 
transcription regulator 

LysR-gltR family 
Unknown, conserved 
hypothetical protein, highly 
similar to B. subtilis YydA 
proteinYyd 
Unknown, similar to 
phospho-N- 
acetylmuramoyl- 
pentapeptide transferase 
unknown, highly similar to 
ABC transporter (ATP- 
binding protein) required 
for expression of 
cytochrome BD 
Unknown, similar to UDP- 
N-acetylmuramoylalanine 
D-glutamate ligase 
Unknown, similar to 
peptidoglycan synthesis 
enzymes, putative 

phospho-N- 
acetylmuramoyl- 
pentapeptide-transferase 
Unknown, similar to cell- 
division initiation protein 
divlB 
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SeqlDn°1181 LM-2495.1 
SeqIDn 0 1182 LM-2496.1 

SeqIDn 0 1183 LM-2497.1 

SeqIDn 0 1184 LM-2499.1 

SeqIDn 0 1185 LM-25.1 
SeqIDn 0 1186 LM-2500.1 

SeqIDn 0 1187 LM-2501.1 

SeqIDn 0 1188 LM-2506.1 

SeqIDn 0 1189 LM-2507.1 

SeqIDn 0 1190 LM-2508.2 
SeqIDn 0 1191 LM-2509.2 

SeqIDn 0 1192 LM-251.1 

SeqIDn 0 1193 LM-2510.2 

SeqIDn 0 1194 LM-2511.1 
SeqIDn 0 1195 LM-2512.1 

SeqIDn 0 1196 LM-2514.2 
SeqIDn 0 1197 LM-2515.2 

SeqIDn 0 1198 LM-2517.1 
SeqIDn 0 1199 LM-2518.1 
SeqIDn 0 1200 LM-252.1 

SeqIDn 0 1201 LM-2520.1 



From 21 1 3734 to 21 1 5014 
From 21 12492 to 21 1 3667 

From 21 1 1684 to 21 12373 

From 21 1 1222 to 21 1 1680 

From 2729457 to 2729927 
From 211 0909 to 21 1 11 99 

From 21 1 0009 to 21 1 0785 

From 2104901 to 2106001 

From 2104059 to 2104904 

From 2102608 to 2104062 

From 2101371 to 2102477 

From 2790243 to 2791 967 

From 2034693 to 2035517 

From 2035507 to 2036505 

From 2036521 to 2037141 

From 20371 41 to 2037956 

From 2037953 to 2038840 

From 2039441 to 2039998 

From 2040274 to 2040933 

From 279 1 967 to 2792980 

From 204091 1 to 20421 10 



unknown, highly similar to 
cell-division protein FtsA 
unknown, highly similar to 
cell-division initiation 

protein FtsZ 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
ribosomal protein S7 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
quinolinate synthetase 
Unknown, similar to 
nicotinate-nucleotide 
pyrophosphorylase 
Unknown, similar to L- 
aspartate oxidase 
Unknown, similar to a NifS- 
like protein required for 
NAD biosynthesis 
unknown, highly similar to 
ABC transporter required 
for expression of 
cytochrome BD 
Unknown, similar to 
ferrichrome ABC 
transporter (ATP-binding 
protein) 
Unknown, similar to 

oxidoreductases 
Unknown, similar to 
transcription regulators 
(TetR family) 
Unknown, similar to 
unknown proteins 
Unknown, similar to ABC 
transporter, ATP-binding 
protein 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
unknown, highly similar to 
cytochrome D ubiquinol 

oxidase subunit II 
Unknown, similar to toxic 
ion resistance proteins 
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SeqlDn°1202 LM-2521.1 From 2042157 
SeqID n° 1203 LM-2522.1 From.2042913 

SeqlDn°1204 LM-2523.1 From 2043540 

SeqlDn°1205 LM-2524.1 From 2044481 
SeqlDn°1206 LM-2527.1 From 2046046 

SeqlDn°1207 LM-2528.1 From 2046507 

SeqlDn°1208 LM-2529.1 From 2047444 

SeqlDn°1209 LM-2530.1 From 2048623 
SeqlDn°1210 LM-2531.1 From 2049419 
SeqlDn°1211 LM-2532.4 From 2050639 

SeqID n° 1212 LM-2533.2 From 1731955 
SeqlDn°1213 LM-2536.1 From 1732926 

SeqlDn°1214 LM-2538.1 From 1734776 
Seq!Dn°1215 LM-2539.1 From 1735941 

SeqID n° 1216 LM-254.1 From 2792967 

SeqlDn°1217 LM-2540.2 From 1737087 

SeqID n° 1218 LM-2542.2 From 2088797 
SeqID n° 1219 LM-2543.1 From 2087489 
SeqID n° 1220 LM-2544.1 From 2086817 



to 2042900 unknown, similar to 

creatinine amidohydrolases 
to 2043521 Unknown, similar to 2-keto- 
3-deoxygluconate-6- 
phosphate aldolase 
to 2044457 Unknown, similar to 
putative 
phosphotriesterase related 
proteins 

to 2045749 Unknown, similar to 
unknown proteins 
to 2046489 Unknown, similar to PTS 
system enzyme II A 
component 
to 2047256 Unknown, similar to 
transcription regulators, 
(GntR family) 
to 2048514 Unknown, similar to E. coli 
DNA-damage-inducibile 
protein dinP 
to 2049414 Unknown, similar to 

oxidoreductase 
to 2050339 Unknown, similar to 
unknown proteins 
to 20521 14 Unknown, similar to 
glucose-6-phosphate 1- 
dehydrogenase 
to 1 732893 unknown, similar to 

menaquinone biosynthesis 
proteins 

to 1734779 unknown, similar to 5- 
methyltetrahydrofolate- 

homocysteine 
methyltransferase (metH) 
to 1735948 unknown; similar to 

cystathionine beta-lyase 
to 1737065 unknown, similar to 
cystathionine gamma- 
synthase 

to 2794373 unknown, highly similar to 
cytochrome D ubiquinol 
oxidase subunit I 
to 1 739384 unknown, similar to 
cobalamin-independent 
methionine synthase 
to 2091454 Unknown, similar to 

putative sugar hydrolases 
to 2088793 Unknown, similar to 
unknown proteins 
to 2087428 unknown, similar to 
unknown proteins 
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SeqID n° .1221 LM-2545.1 From 2085021 to 2086760 

SeqlDn°1222 LM-2546.1 From 2083537 to 2085021 

SeqID n° 1223 LM-2547.1 From 2082495 to 2083424 

SeqID n° 1224 LM-2548.1 From 2081505 to 2082476 

SeqID n° 1225 LM-2549.1 From 2080027 to 2081484 

SeqID n° 1226 LM-255.1 From 2794755 to 2795225 

SeqID n° 1227 LM-2553.1 From 2078132 to 2079829 

SeqID n° 1228 LM-2554.3 From 2077019 to 2078014 

SeqID n° 1229 LM-2557.2 From 634479 to 635279 



SeqID n° 1230 LM-2559.2 From 633709 to 634251 
SeqID n° 1231 LM-2560.1 From 63281 1 to 633581 



SeqID n° 1232 LM-25611 From 631045 to 63281 1 



SeqID n° 1233 LM-2562.1 
SeqID n° 1234 LM-2563.1 



From 630629 
From 629088 



to 631042 
to 630491 



SeqID n° 1235 LM-2565.1 From 626506 to 628971 



SeqID n° 1236 LM-2566.1 

SeqID n° 1237 LM-2567.2 

SeqID n° 1238 LM-2568.3 

SeqID n° 1239 LM-2570.1 



From 625463 
From 624682 

From 702088 

From 701247 



to 626488 
to 625395 

to 702954 

to 702062 



SeqID n° 1240 LM-2571.1 From 700876 to 701184 
SeqID n° 1241 LM-2573.1 From 700537 to 700821 



unknown, similar to two- 
component sensor histidine 

kinase 
•unknown, similar to two- 
component response 

regulator 
Unknown, similar to 
putative transport system 
integral membrane protein 
Unknown, similar to ABC 
transporter, permease 
protein 
Unknown, similar to 
putative sugar-binding 

lipoproteins 
Unknown, conserved 
hypothetical proteins 
Unknown, similar to alpha- 
acetolactate synthase 

protein, AIsS 
Unknown, similar to 

oxidoreductase 
Unknown, similar to 
transport proteins 
(formate?) 
Unknown 
Unknown, similar to 
unknown membrane 

proteins 
Unknown, similar to a 
fusion of two types of 
conserved hypothetical 
proteinconserved 
hypothetical 

Unknown 
Unknown, similar to DNA 
photolyase 
Unknown, putative 
secreted protein 

Unknown 
Unknown, putative . 
secreted protein 
Unknown, conserved 
hypothetical proteins 
unknown, highly similar to 
phosphomethylpyrimidine 
kinase thiD 
Unknown, similar to 
unknown proteins 
unknown, similar to 
transposases 
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SeqID n° 1242 


LM-2574.1 


From 699410 


to 700306 


SeqID n° 1243 


LM-2575.1 


From 698785 


to 699420 


SeqID n° 1244 
SeqID n° 1245 


LM-2577.1 
LM-2578.1 


From 698317 
From 697634 


to 698733 
to 698197 


SeqID n° 1246 


LM-2579.1 


From 696883 


to 697590 


SeqID n° 1247 


LM-258.1 


From 2795420 


to 2796997 


SeqID n° 1248 
SeqID n° 1249 


LM-2581.1 
LM-2583.1 


From 695496 
From 694973 


to 696416 
to 695509 


SeqID n° 1250 


LM-2584.1 


From 694317 


to 694964 


SeqID n° 1251 


LM-2587.1 


From 691581 


to 694271 


oeqiu n 1252 


LM-2589.1 


From 690897 


to 691538 


beqiu n 1253 


LM-259.1 


From 2797039 


to 2797758 


oeqiu n 1254 


LM-2590.1 


From 689923 


to 690873 


SeqID n° 1255 
SeqID n° 1256 


LM-2592.1 
LM-2593.1 


From 689541 
From 688619 


to 689825 
to 689458 


SeqID n° 1257 


LM-2594.2 


From 687138 


to 688529 


SeqID n° 1258 


LM-2597.3 


From 1113759 


to 1115630 


SeqID n° 1259 


LM-2598.3 


From 1115647 


to 1116513 


Qonin ri° H OCA 

oeqiu n i^ou 


LM-2599.1 


From 1116532 


to 1117092 


SeqID n° 1261 
SeqID n° 1262 


LM-26.1 
LM-260.1 


From 2729958 
From 2797838 


to 2730371 
to 2798572 


SeqID n° 1263 


LM-2600.1 


From 1117093 


to 1118079 


SeqID n° 1264 


LM-2601.1 


From 1118082 


to 1118912 


SeqID n° 1265 


LM-2602.1 


From 1118992 


to 1121022 



Unknown, similar to 
transcription regulator (Rgg 
type) 

Unknown, conserved 
hypothetical protein 

Unknown 
Unknown, conserved 
hypothetical protein 
Unknown, similar to 
phosphoprotein 
phosphatases 
Unknown, similar to 
acetate-CoA ligase 

Unknown 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
transcription regulator 
Unknown, conserved 
membrane protein 
Unknown, similar to 
transcription regulators 
Unknown, similar to 
glucosamine-6-phosphate 
isomerase 
Unknown, similar to 
membrane proteins 

Unknown 
Unknown, similar to 
unknown proteins 
Unknown, similar to amino 
acid transporter 
unknown, similar to B. 
subtilis minor teichoic acids 
biosynthesis protein GgaB 

Unknown, similar to 
glucose-1-phosphate 
thymidyl transferase 
Unknown, similar to dTDP- 
sugar epimerase 
ribosomal protein S12 
Unknown, similar to merR- 
family transcriptional 
regulator 
Unknown, similar to dTDP- 
D-glucose 4,6-dehydratase 
unknown, similar to DTDP- 

L-rhamnose synthetase 
unknown, similar to teichoic 
acid biosynthesis protein B 
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SeqlDn°1266 LM-2603.1 From 1121098 to 1121808 

SeqlDn°1267 LM-2604.1 From 1121805 to 1122830 

SeqlDn°1268 LM-2606.1 From 1122918 to 1124078 

SeqlDn°1269 LM-2607.1 From 1124079 to 1124462 



SeqlDn°1270 LM-2608.1 From 1124484 to 1125467 

SeqlDn°1271 LM-2609.1 From 1125482 to 1126495 

SeqID n° 1 272 LM-261 . 1 From 2798626 to 2799087 

SeqlDn°1273 LM-261 1.1 From 1126704 to 1128194 

SeqID n° 1274 LM-2613.1 From 1128206 to 1129030 



SeqID n° 1275 LM-2614.1 From 1129043 to 1129351 
SeqID n° 1276 LM-2615.1 From 1129412 to 1129816 



SeqID n° 1277 LM-2616.1 From 1129945 to 1131501 

SeqID n° 1278 LM-2618.1 From 1131558 to 1132760 

SeqID n° 1279 LM-2619.1 From 1133698 to 1134117 

SeqID n° 1280 LM-262. 1 From 2799108 to 2799551 

SeqID n° 1281 LM-2621.4 From 1134472 to 1136595 

SeqID n° 1282 LM-2623.2 From 2119833 to 2121308 

SeqID n° 1283 LM-2625.2 From 2121483 to 2123738 

SeqID n° 1284 LM-2626.2 From 2123735 to 2124097 

SeqID n° 1285 LM-2628.1 From 2124114 to 2125052 



unknown, similar to CDP- 
ribitol pyrophosphorylase 
unknown, similar to glucitol 

dehydrogenase 
unknown, similar to teichoic 
acid biosynthesis protein B 

precursor 
unknown, highly similar to 

glycerol-3-phosphate 
cytidylyltransferase (get), 
CDP-glycerol 
pyrophosphorylase 
(teichoic acid biosynthesis 
protein D) 
unknown, similar to 
glycosyltransferases 
unknown, siumilar to 
glysosyltransferases 
Unknown, similar to 
unknown proteins 
unknown, conserved 
hypothetical protein, similar 
to B. subtilis YueK protein 

unknown, similar to NH(3)- 
dependent NAD(+) 
synthetases, nitrogen 
regulatory protein 
unknown 
unknown, similar to PTS 
system, cellobiose-specific 

MB component (eel A) 
unknown, highly similar to 
GMP synthetase 
unknown, similar to 

integrases 
unknown, similar to a 
protein encoded by Tn916 
Unknown, similar to 
unknown proteins 
cadmium resistance protein 
Unknown, similar to UDP- 
N-acetylmuramoylalanyl-D- 
glutamate-2,6- 
diaminopimelate ligase 
Unknown, similar to 
penicillin-binding protein 2B 

Unknown, similar to cell- 
division protein FtsL 
Unknown, similar to 
unknown proteins 
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SeqID n° 


1286 


LM-2629.1 


From 2125065 


SeqID n 


1287 


LM-263.1 


From 2799609 


SeqID n" 


1288 


LM-2631.2 


From 2125699 


SeqID n° 


1289 


LM-2632.1 


From 2127062 


SeqID n° 
SeqID n° 


1290 
1291 


LM-2633.1 
LM-2635.1 


From 2128806 
From 2129185 


SeqID n° 


1292 


LM-2637.1 


From 2130492 


SeqID n° 


1293 


LM-2638.1 


From 2131158 


SeqID n° 


1294 


LM-2639.1 


From 2132412 


SeqID n° 


1295 


LM-264.1 


From 2800958 


SeqID n° 


1296 


LM-2641.1 


From 2134695 


SeqID n" 


1297 


LM-2642.1 


From 2135754 


SeqID n° 1298 


LM-2643.1 


From 2136239 


SeqID n° 


1299 


LM-2645.1 


From 2136893 


SeqID n° 


1300 


LM-2647.1 


From 2137205 


SeqID n° 


1301 


LM-2648.1 


From 2137702 


SeqID n° 


1302 


LM-265.1 


From 2801577 


SeqID n° 


1303 


LM-2650.2 


From 2138904 


SeqID n° 


1304 


LM-2651.2 


From 1994751 


SeqID n° 1305 


LM-2652.1 


From 1995520 


SeqID n° 


1306 


LM-2653.1 


From 1996204 


SeqID n° 


1307 


LM-2654.1 


From 1996784 


SeqID n° 


1308 


LM-2657.1 


From 1998168 


SeqID n° 


1309 


LM-2659.1 


From 1999468 



to 2125496 Unknown, similar to 
unknown proteins 
to 2800961 Unknown, conserved 
hypothetical proteins 
to 2126946 Unknown, similar to 

integral membrane proteins 
to 2128876 Unknown, similar to 

transporter binding proteins 
to 2129192 Unknown 
to 2130078 Unknown, weakly similar to 
ketopantoate reductase 
involved in thiamin 
biosynthesis 
to 2131028 Unknown, similar to 
unknown proteins 
to 2132330 Unknown, similar to 
unknown proteins 
to 2134652 Unknown, similar to 

excinuclease ABC (subunit 
A) 

to 2801407 Unknown, similar to 

transcription regulators 
to 2135735 Unknown, weakly similar to 

proteases 
to 2136236 Unknown, similar to 
phosphopantetheine 
adenylyltransferase 
to 2136796 Unknown, similar to 
unknown proteins 
to 2137174 Unknown, similar to 
unknown proteins 
to 2137654 Unknown, similar to 
unknown proteins 
to 2138757 Unknown, similar to 
unknown proteins 
to 2802458 Unknown, similar to 
unknown proteins 
to 2139809 unknown, highly similar to 
heme A famesyltransferase 
to 1 995434 Unknown, similar to 
unknown proteins 
to 19961 16 Unknown, similar to 
unknown proteins 
to 1 996749 Unknown, similar to 
unknown proteins 
to 1998037 Unknown, similar to 
unknown proteins 
to 1999454 Unknwon, similar to 5- 
enolpyruvylshikimate-3- 
phosphate synthase 
to 2000571 Unknown, similar to 

prephenate dehydrogenase 



WO 01/77335 



107 



PCT/FR01/01118 



SeqID n° 1310 LM-266.1 From 2802543 to 2802998 Unknown, similarto 

transcription regulator 
MerR family 

SeqID n° 131 1 LM-2660.1 From 2000592 to 2001674 Unknown, similar to 

histidinol-phosphate 
aminotransferase and 
tyrosine/phenylalanine 
aminotransferase 

SeqID n° 1312 LM-2661.1 From 2001674 to 2002048 Unknown, similar to 

chorismate mutase 

SeqID n° 1313 LM-2662.1 From 2002045 to 2003142 Unknown, similar to 3- 

dehydroquinate synthase 
SeqID n° 1314 LM-2664.2 From 2003145 to 2004311 Unknown, similar to 

chorismate synthase 

SeqID n° 1315 LM-267.1 From 2802998 to 2803402 Unknown, similar to 

unknown proteins 

SeqID n° 1316 LM-2675.4 From 1748263 to 1748709 unknown, similar to 

transcription regulators 
(Fur family), PerR in B. 
subtilis 

SeqID n° 1317 LM-2676.2 From 1356769 to 1357683 unknown, highly similar to 

tRNA pseudouridine 55 
synthase 

SeqID n° 1318 LM-2677.1 From 1356327 to 1356671 unknown, highly similar to 

ribosome-binding factor A 

SeqID n° 1319 LM-2679.1 From 1356032 to 1356310 unknown, conserved 

hypothetical protein similar 
to B. subtilis YlxP protein 

SeqID n° 1320 LM-268.1 From 2803445 to 2804056 unknown, similar to 

phosphatase 

SeqID n° 1321 LM-2683.1 From 1353696 to 1356035 unknown, highly similar to 

translation initiation factor 
IF-2 

SeqID n° 1322 LM-2684.1 From 1353374 to 1353673 unknown, conserved 

hypothetical protein, similar 
to B. subtilis YlxQ protein 

SeqID n° 1323 LM-2685.1 From 1353097 to 1353381 unknown, similarto B. 

subtilis YlxR protein 

SeqID n° 1324 LM-2687.1 From 1351964 to 1353082 unknown, highly similar to 

N utilization substance 
protein A (NusA protein) 
SeqID n° 1325 LM-2689.1 From 1351467 to 1351934 unknown, conserved 

hypothetical protein, similar 
to B. subtilis YlxS protein 

SeqID n° 1326 LM-269.1 From 2804070 to 2804438 unknown, similar to 

transcription regulator 
(RpiR family) 

SeqID n° 1327 LM-2692.1 From 1346951 to 1351285 unknown, highly similar to 

DNA polymerase III (alpha 
subunit) 
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SeqlDn°1328 LM-2693.1 From 1345140 to 1346846 
SeqlDn°1329 LM-2696.2 From 1343838 to 1345100 



SeqlDn°1330 LM-2698.2 From 1342682 to 1343824 



SeqlDn°1331 LM-2699.2 From 1341879 to 1342667 



SeqID n° 1332 LM-27.1 From 2730564 to 2731961 

SeqID n° 1333 LM-270.1 From 2804522 to 2805274 
SeqID n° 1334 LM-2701.3 From 1779714 to 1780802 



SeqID n° 1335 LM-2702.1 From 1780824 to 1781126 



SeqID n° 1336 LM-2703.1 From 1781196 to 1781504 



SeqID n° 1337 LM-2705.2 From 1781661 to 1784339 



SeqID n° 1338 LM-2707.2 From 1784473 to 1785798 

SeqID n° 1339 LM-2708.1 From 1785841 to 1786614 
SeqID n° 1340 LM-2710.1 From 1786607 to 1787287 



SeqID n° 1341 LM-2711.1 From 1787280 to 1787648 

SeqID n° 1342 LM-2712.1 From 1787769 to 1788752 

SeqID n° 1343 LM-2713.1 From 1788812 to 1789777 

SeqID n° 1344 LM-2714.1 From 1789825 to 1793085 

SeqID n° 1345 LM-2717.2 From 1793082 to 1795253 

SeqID n° 1346 LM-2718.2 From 1561978 to 1564242 



proIyl-tRNA synthetase 
unknown, conserved 
hypothetical protein similar 
to B. subtilis YluC protein 

Unknown, similar to 
deoxyxylulose 5-phosphate 
reductoisomerase 
unknown, similar to 
phosphatidate 
cytidylyltransferase (CDP- 

diglyceride synthase) 
Unknown, similar to dGTP 
triphosphohydrolase 

unknown 
unknown, similar to 
putative outer surface 

protein 
unknown, similar to 
phosphotransferase 
system (PTS) lichenan- 
specific enzyme DA 

component 
unknown, similar to 
phosphotransferase 
system (PTS) lichenan- 
specific enzyme IIB 

component 
unknown, similar to 
transcriptional regulator 

(NifA/NtrC family) 
unknown, similar to ATP- 
dependent RNA helicases 

unknown 
ubknown, similar to ABC 
transporter, ATP-binding 
protein 
unknown, similar to 
transcriptional regulator 
(GntR family) 
unknown, similar to 
hypothetical proteins 
unknown, similar to 
transcription regulators 
(Lad family) 
unknown, some similarities 
to cellobiose- 
phosphorylase 
unknown, similar to beta- 
glucosidases 
Unknown, similar to 
protein-export membrane 
protein SecDF 
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SeqID n° 1 347 LM-272. 1 From 2805440 to 2807398 

SeqID n° 1348 LM-2720.1 From 1564343 to 1564633 

SeqID n° 1349 LM-2721.1 From 1564776 to 1565105 

SeqID n° 1350 LM-2722.1 From 1565139 to 1566278 

SeqID n° 1351 LM-2723.1 From 1566365 to 1567393 

SeqID n° 1352 LM-2725.1 From 1567397 to 1568404 

SeqID n° 1353 LM-2726.1 From 1568420 to 1569025 

SeqID n° 1354 LM-2728.1 From 1569149 to 1570084 

SeqID n° 1355 LM-2729.1 From 1570154 to 1570879 

SeqID n° 1356 LM-273.1 From 2807459 to 2810107 

SeqID n° 1357 LM-2730.1 From 1570990 to 1571838 

SeqID n° 1358 LM-2732.1 From 1571905 to 1573194 

SeqID n° 1359 LM-2735.1 From 1573354 to 1574847 

SeqID n° 1360 LM-2736.2 From 1574922 to 1575740 

SeqID n° 1361 LM-2737.2 From 623251 to 624423 

SeqID n° 1362 LM-274.1 From 2810109 to 281 1791 

SeqID n° 1363 LM-27411 From 620805 to 623135 

SeqID n° 1364 LM-2743.3 From 618932 to 620380 

SeqID n° 1365 LM-2746.1 From 617660 to 618844 

SeqID n° 1366 LM-2747.1 From 616973 to 617629 

SeqID n° 1367 LM-275.1 From 281 1788 to 2812921 



Unknown, similar to PTS 
system, fructose-specific 
IIABC component 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to tRNA- 
guanine transglycosylase 
Tgt 

Unknown, similar to S- 
adenosylmethionine.tRNA 
ribosyltransferase- 
isomerase 
unknown, highly similar to 
Holliday junction DNA 
helicase RuvB 
unknown, highly similar to 
Holliday junction DNA 

helicase (ruvA) 
Unknown, similar to L- 
lactate dehydrogenase 
Unknown, similar to 
unknown proteins 
Unknown, weakly similar to 
sugar hydrolase 
Unknown, similar to 
prephenate dehydratase 
PheA 

Unknown, conserved GTP 
binding protein 
Unknown, similar to 

glycerol kinase 
Unknown, similar to 
glycerol uptake facilitator 

Unknown, conserved 
hypothetical membrane 
protein 
Unknown, similar to 
Sucrose phosphorylase 

Unknown, similar to 
preprotein translocase 
SecA subunit 
P60 extracellular protein, 
invasion associated protein 
lap 

Unknown, conserved 
hypothetical protein 
Unknown, weakly similar to 
carboxylesterase 
Unknown, conserved 
hypothetical protein 
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SeqlDn°1368 LM-2752.1 From 615825 to 616577 



SeqID n° 1369 LM-2753.1 From 615353 to 615811 
SeqlDn°1370 LM-2754.1 From 613836 to 615299 
SeqlDn°1371 LM-2755.2 From 612678 to 613406 



SeqlDn°1372 LM-2758.2 From 2343858 to 2345150 



SeqlDn°1373 LM-2759.1 From 2345260 
SeqlDn°1374 LM-276.1 From 2812925 



SeqlDn°1375 LM-2760.1 From 2345567 
SeqID n° 1376 LM-2761.1 From 2346252 
SeqlDn°1377 LM-2762.1 From 2346935 



SeqID n° 1378 
SeqID n° 1379 
SeqID n 6 1380 
SeqID n" 1381 
SeqID n° 1382 
SeqID n° 1383 
SeqID n° 1384 
SeqID n° 1385 



LM-2763.1 
LM-2764.1 
LM-2765.1 
LM-2766.1 
LM-2768.1 
LM-2769.1 
LM-2770.1 
LM-2771.1 



From 2347276 
From 2348231 
From 2348604 
From 2349475 
From 2350398 
From 2351384 
From 2351767 
From 2352616 



to 2345526 
to 2813881 



to 2346088 
to 2346596 
to 2347285 

to 2348151 
to 2348539 
to 2349356 
to 2350320 
to 2351300 
to 2351746 
to 2352615 
to 2356323 



SeqID n° 1386 LM-2772.3 From 2356325 to 2359798 



SeqID n° 1387 LM-2774.2 From 2097176 to 2099941 
SeqID n° 1388 LM-2776.1 From 2096075 to 2097064 

SeqID n° 1389 LM-2778.1 From 2095366 to 2096061 



Unknown, putative 
conserved membrane 
protein 
Unknown 
Unknown 
Unknown, similar to 
transcription regulator 

GntR family 
Unknown, similar to 
unknown proteins 

Unknown 
Unknown, similar to 
transcriptional regulator 
(Lacl family) 
Unknown, similar to 
unknown proteins 
Unknown, hypothetical 
CDS 

unknown, similar to 
phosphotransferase 
system (PTS) beta- 
glucoside-specific enzyme 
IIA 

Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
unknown, similar to 

unkown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to ATP- 

dependent 
deoxyribonuclease (subunit 
A) 

Unknown, similar to ATP- 

dependent 
deoxyribonuclease (subunit 
B) 

isoleucyl-tRNA synthetase 
Unknown, similar to 

diaminopimelate epimerase 
Unknown, similar to 
unknown proteins 
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SeqID n° 


1390 


LM-278.1 


From 2813949 


to 2815283 


_ tr\ m.o 

SeqID n 


1391 


LM-2780.2 


From 2091698 


to 2094808 


SeqID n° 


1392 


LM-2781.3 


From 2504300 


to 2505157 


SeqID n° 


1393 


LM-2782.1 


From 2503931 


to 2504236 


SeqID n° 


1394 


LM-2784.1 


From 2502401 


to 2503804 


SeqID n° 


1395 


LM-2785.1 


From 2501603 


to 2502361 


SeqID n° 


1396 


LM-2788.1 


From 2500179 


to 2501150 


SeqID n°1397 


LM-2790 1 


From 2499169 


to 2500179 


SeqID n° 


1398 


LM-2792.1 


From 2498381 


to 2499172 


SeqID n° 


1399 


LM-2793 1 


From 2497069 

1 1 Ul 1 1 fctwf www 


to 24Q8?'3fi 


SeqID n° 


1400 


LM-2794.1 


From 2495818 


to 2496993 


SeqID n° 


1401 


LM-2795.2 


From 2495309 


to 2495662 


SeqID n° 


1402 


LM-2796.1 


From 1674961 


to 1676325 


SeqID n° 


1403 


LM-2797.1 


From 1674359 


to 1674964 


SeqID n° 


1404 


LM-2798.1 


From 1673368 


to 1674387 


SeqID n° 


1405 


LM-2799.1 


From 1672613 


to 1673371 


SeqID n° 


1406 


LM-28.1 


From 2731976 


to 2732455 


SeqID n° 


1407 


LM-280.1 


From 2815401 


to 2816090 



Unknown, conserved 
hypothetical protein similar 
to hypothetical hemolysin 
Unknown, similar to alpha- 
mannosidase 
Unknown, similar to 
transcription antiterminator 
Unknown, similar to B. 
subtilis YfhL protein 
unknown, highly similar to 
glutamate decarboxylases 
Unknown, similar to 
acetylesterase 
Unknown, similar to B. 
subtilis ferrichrome ABC 
transporter fhuD precursor 
(ferrichrome-binding 

protein) 
Unknown, similar to B. 
subtilis ferrichrome ABC 
transporter (permease) 
FhuG 

Unknown, similar to B. 
subtilis ferrichrome ABC 
transporter (ATP-binding 

protein) FhuC 
Unknown, similar to cell 
division proteins RodA, 
FtsW 

Unknown, similar to cell 
division proteins RodA, 
FtsW 

Unknown, conserved 
hypothetical proteins 
unknown, highly similar to 
anthranilate synthase alpha 

subunit 
unknown, highly similar to 
anthranilate synthase beta 

subunit 
unknown, highly similar to 

anthranilate 
phosphoribosyltransferase 
unknown, highly similar to 
indol-3-glycerol phosphate 
synthases 
Unknown, similar to 
spermidine/spermine N1- 
acetyl transferase 
unknown, similar to 
regulatory proteins of the 
SIR2 family 
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SeqID n° 


1408 


LM-2801.1 


From 1672008 


to 1672616 


SeqID n° 


1409 


LM-2804.1 


From 1670803 


to 1672005 


SeqID n 


A A a r\ 

1410 


LM-2805.1 


From 1670037 


to 1670810 


SeqID n° 
SeqID n° 


1411 
1412 


LM-2806.1 
LM-2808.1 


From 1669529 
From 1667831 


to 1669966 
to 1669444 


SeqID n° 


1413 


LM-2809.1 


From 1666107 


to 1667720 


SeqID n° 


1414 


LM-2810.1 


From 1665444 


to 1666097 


SeqID n° 


1415 


LM-2811.2 


From 1664575 


to 1665405 


SeqID n° 
SeqID n° 


1416 
1417 


LM-2812.2 
LM-2813.2 


From 49585 
From 49934 


to 49878 
to 50470 


SeqID n° 
SeqID n° 


1418 
1419 


LM-2816.1 
LM-2817.1 


From 50907 
From 51775 


to 51518 
to 52389 


SeqID n° 


1420 


1 Ail /"» A rt A 

LM-2818.1 


From 52630 


to 53925 


SeqID n° 


1421 


LM-2819.1 


From 53944 


to 54672 


SeqID n° 
SeqID n° 


1422 
1423 


LM-282.1 
LM-2820.1 


From 2816083 
From 54839 


to 2816373 
to 56812 


SeqID n° 
SeqID n 


1424 
1425 


LM-2822.1 
LM-2823.2 


From 56815 
From 57286 


to 57261 
to 58638 


SeqID n° 


1426 


LM-2826.1 


From 2580470 


to 2581780 


SeqID n° 


1427 


LM-2827.1 


From 2581900 


to 2583105 


SeqID n° 


1428 


LM-2829.1 


From 2583181 


to 2584065 


SeqID n° 


1429 


LM-283.1 


From 2816363 


to 2817571 


SeqID n° 


1430 


LM-2831.1 


From 2584055 


to 2584741 


SeqID n° 


1431 


LM-2832.1 


From 2585245 


to 2586108 



phosphoribosyl anthranilate 

isomerase 
unknown, highly similar to 
tryptophan synthase (beta 

subunit) 
unknown, highly similar to 
tryptophan synthase (alpha 
subunit) 
unknown 
Unknown, similar to 
putative transporters 
Unknown, similar to 
putative transporters 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
30S ribosomal protein S6 
unknown, highly similar to 
single-strand binding 
protein (SSB) 
unknown 
Unknown, similar to 
Staphylococcus AgrB 
protein 

Unknown, similar to sensor 
histidine kinase (AgrC from 
Staphylococcus) 
Unknwon, similar to 2- 
components response 
regulator protein (AgrA 
from Staphylococcus) 
Unknown 
Unknown, highly similar to 
B. subtilis YybT protein 
50S ribosomal protein L9 
unknown, highly similar to 
replicative DNA helicases 
Unknown, similar to cell 

wall binding proteins 
peptidoglycan lytic protein 
P45 

unknown, highly similar to 
cell-division protein FtsX 
Unknown, similar to drug- 
efflux transporters 
unknown, highly similar to 
the cell-division ATP- 
binding protein FtsE 
Unknown, similar to 
conserved hypothetical 
proteins 
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SeqID n° 1432 LM-2833.1 From 25861 14 to 2587097 



SeqlDn°1433 LM-2834.2 From 2587300 to 2589813 

SeqID n° 1434 LM-2835.2 From 1052509 to 1053429 
SeqID n° 1435 LM-2836.1 From 1051695 to 1052354 



SeqID n° 1436 LM-2838.1 From 1051038 to 1051676 

SeqID n° 1437 LM-2839.1 From 1049983 to 1051041 

SeqID n° 1438 LM-284.1 From 2817688 to 2818035 

SeqID n° 1439 LM-2840.1 From 1049273 to 1049986 

SeqID n° 1440 LM-2842.1 From 1048266 to 1049138 

SeqID n° 1441 LM-2843.1 From 1047490 to 1048185 

SeqID n° 1442 LM-2845.1 From 1046886 to 1047377 

SeqID n° 1443 LM-2846.1 From 1045869 to 1046771 

SeqID n° 1444 LM-2847.1 From 1045007 to 1045855 

SeqID n° 1445 LM-285.1 From 2818054 to 2818698 

SeqID n° 1446 LM-2851.1 From 1043821 to 1045014 

SeqID n° 1447 LM-2853.1 From 1042605 to 1043450 

SeqID n° 1448 LM-2854.1 From 1041446 to 1042561 

SeqID n° 1449 LM-2856.1 From 1040672 to 1041382 



unknown, highly similar to 
peptide chain release 
factor 2 
translocase binding subunit 
(ATPase) 
unknown 
unknown, similar to a 
bacterial K(+)-uptake 
system 
unknown, similar to two- 
component response 
regulator, in particular B. 

subtilis YvqC protein 
unknown, similar to two- 
component sensor histidine 
kinase in particular B. 
subtilis YvqE protein 

unknown 
unknown, similar to B. 
subtilis YvqF protein 
unknown, similar to B. 
subtilis YitL protein 
unknown, similar to E. coli 
copper homeostasis 

protein CutC 
unknown, similar to 
phosphotransferase 
system glucose-specific 

enzyme IIA 
unknown, highly similar to 
glycine betaine ABC 
transporters (glycine 
betaine-binding protein) 
unknown, highly similar to 

glycine betaine ABC 
transporters (permease) 
Unknown, similar to 
transaldolase 
unknown, highly similar to 

glycine betaine ABC 
transporter (ATP-binding 
protein) 
unknown, similar to 
conserved hypothetical 
proteins like to B. subtilis 

YkuT protein 
unknown, similar to N-acyl- 
L-amino acid 
amidohydrolases 
unknown, similar to 
tetrahydrodipicolinate 
succinylase 
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SeqlDn°1450 LM-2857.1 From 1039746 to 1040624 

SeqlDn°1451 LM-2858.1 From 1039297 to 1039749 

SeqlDn°1452 LM-2859.2 From 1038863 to 1039096 

SeqID n° 1453 LM-286. 1 From 2818841 to 2819530 

SeqID n° 1454 LM-2860.1 From 600837 to 601 148 

SeqlDn°1455 LM-2861.1 From601149 to601466 

SeqID n° 1456 LM-2863.1 From 601463 to 602218 

SeqID n° 1457 LM-2864.1 From 602208 to 602930 

SeqID n° 1458 LM-2866.1 From 602909 to 603535 

SeqID n° 1459 LM-2867.1 From 603536 to 604120 

SeqID n° 1460 LM-2868.1 From 604121 to 605404 

SeqID n° 1461 LM-287.1 From 2819860 to 2821587 

SeqID n° 1462 LM-2870.1 From 605401 to 606042 

SeqID n° 1463 LM-2871 .2 From 606039 to 607220 

SeqID n° 1464 LM-2872.2 From 1645958 to 1646413 

SeqID n° 1465 LM-2873.3 From 1646669 to 1647781 

SeqID n° 1466 LM-2875.1 From 1647821 to 1648366 

SeqID n° 1467 LM-2877.1 From 1648511 to 1649854 

SeqID n° 1468 LM-2878.2 From 1650149 to 1652500 

SeqID n° 1469 LM-288.1 From 2821641 to 2821892 

SeqID n° 1470 LM-2880.1 From 1652810 to 1653427 



unknown, similar to 
transcription regulator 

(LysR family), 
unknown, similar to B. 
subtilis YkuL protein 
unknown, similar to B. 
subtilis YkuJ protein 
Unknown, weakly similar to 
transcription regulators 
CRP/FNR family 
Unknown, similar to 
phosphorybosil-AMP- 
cyclohydrolase (Hisl2 

protein) 
Unknown, similar to 
phosphoribosyl-AMP 
cyclohydrolase (HisI1 
protein) 
unknown, highly similar to 

cyclase HisF 
unknown, highly similar to 
phosphoribosylformimino- 
5-aminoimidazole 
carboxamide ribotide 

isomerase 
unknown, similar to 
amidotransferases 
imidazoleglycerol- 
phosphate dehydratase 
unknown, highly similar to 
histidinol dehydrogenases 
Unknown, similar to ABC 
transporter (ATP-binding 

protein) 
Unknown, similar to ATP 
phosphoribosyltransferase 
histidyl-tRNA synthetase 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
aminopeptidase 
Unknown, similar to 2-cys 

peroxiredoxin 
unknown, similar to UDP- 
N-acetyl muramate-alanine 

ligases 
Unknown, similar to DMA 
translocase 
Unknown 
Unknown, similar 
phenylalanyl-tRNA 
synthetase (beta subunit) 
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SeqID n° 1471 LM-2881 .1 From 1653433 to 1654233 Unknown, similar to 

unknown proteins 

SeqID n° 1472 LM-2882.1 From 1654268 to 1654579 Unknown, similar to 

thioredoxin 

SeqID n° 1473 LM-2883.1 From 1654902 to 1655975 Unknown, similar to 

aminopeptidase 

SeqID n° 1474 LM-2884.1 From 1656173 to 1656484 Unknown, similar to 

unknown proteins 

SeqID n° 1475 LM-2885.1 From 1656521 to 1656769 Unknown, similar to 

unknown proteins 

SeqID n° 1476 LM-2886.1 From 1656903 to 1657754 Unknown, similar to 

unknown proteins 

SeqID n° 1477 LM-2887.2 From 1657815 to 1658459 Unknown, similar to 

unknown proteins 

SeqID n° 1478 LM-2889.2 From 1658466 to 1659248 Unknown, similar to 

unknown proteins 

SeqID n° 1479 LM-289.1 From 2821912 to 2823195 seryl-trna synthetase 
SeqID n° 1480 LM-2890.2 From 1307009 to 1307605 unknown, conserved 

hypothetical protein, similar 
to B. subtilis YneS protein 

SeqID n° 1481 LM-2892.1 From 1306095 to 1306967 Unknown, similar to 

Lactococcus lactis LacX 
protein 

SeqID n° 1482 LM-2894.1 From 1305781 to 1306092 Unknown, similar to B. 

subtilis YneQ protein 

SeqID n° 1 483 LM-2895. 1 From 1 305390 to 1 305758 Unknown, similar to B. 

subtilis YneP protein 

SeqID n° 1484 LM-2898.1 From 1304457 to 1305236 unknown, highly similar to 

B. subtilis CodY protein 
SeqID n° 1485 LM-29.1 From 2732589 to 2733233 Unknwon, similar to 

ribulose-phosphate 3- 
epimerase 

SeqID n° 1486 LM-290.1 From 2823530 to 2823949 Unknown, similar to B. 

subtilis stress protein YdaG 

SeqID n° 1487 LM-2900.1 From 1303027 to 1304436 unknown, highly similar to 

ATP-dependent Clp 
protease-like proteins 

SeqID n° 1488 LM-2901.1 From 1302474 to 1303013 unknown, highly similar to 

beta-type subunit of the 
20S proteasome 

SeqID n° 1489 LM-2902.2 From 1301551 to 1302453 unknown, similar to 

integrase/recombinase 

SeqID n° 1490 LM-2903.2 From 1299964 to 1301268 unknown, similar glucose 

inhibited division protein A 

SeqID n° 1491 LM-2907.2 From 1297823 to 1299901 unknown, highly similar to 

DNA topoisomerase I TopA 
SeqID n° 1492 LM-2908.2 From 1296691 to 1297551 Unknown, similar to 

polypeptide deformylase, 
similar to B. subtilis Smf 
protein 
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SeqID n° 1493 


LM-2909.1 


From 


1295772 


to 1296557 


SeqID n° 1494 


LM-291.1 


From 


2824079 


to 2824651 


SeqID n° 1495 


LM-2910.1 


From 


1294912 


to 1295775 


SeqID n° 1496 


LM-2911.1 


From 


1294360 


to 1294902 


SeqID n° 1497 


LM-2912.2 


From 


1293689 


to 1294258 


SeqID n° 1498 


LM-2914.2 


From 


1498096 


to 1500252 


SeqID n° 1499 


LM-2915.1 


From 


1500268 


to 1501227 


SeqID n° 1500 


LM-2916.1 


From 


1501415 


to 1501861 


SeqID n° 1501 


LM-2917.1 


From 


1502252 


to 1503019 


SeqID n° 1502 


LM-2918.1 


From 


1503020 


to 1503964 



SeqID n° 1503 LM-292.1 From 2824648 to 2826354 



SeqID n° 1504 LM-2921.1 From 1504037 to 1505170 
SeqID n° 1505 LM-2922.1 From 1505312 to 1507153 



SeqID n° 1506 LM-2924.1 From 1507187 to 1507762 
SeqID n° 1507 LM-2925.1 From 1507804 to 1508841 



SeqID n° 1508 LM-2926.1 From 1508988 to 1510145 

SeqID n° 1509 LM-2928.1 From 1510234 to 1511262 

SeqID n° 1510 LM-2929.1 From 1511375 to 1511812 

SeqID n° 1511 LM-2931.3 From 1511858 to 1513684 

SeqID n° 1512 LM-2932.2 From 2535342 to 2536268 

SeqID n° 1513 LM-2933.1 From 2533840 to 2535183 

SeqID n° 1514 LM-2935.1 From 2532352 to 2533398 



unknown, similar to 
ribonuclease H mh 
Unknown, similar to 

glutamine 
amidotransferase 
unknown, conserved 
hypothetical protein similar 
to B. subtilis YlqF protein 
Unknown, similar to signal 

peptidase I 
Unknown, similar to signal 
peptidase I 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
phosphate starvation 
induced protein PhoH 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
ribosomal protein L1 1 
methyltransferase 
Unknown, similar to para- 
aminobenzoate synthase 

component I 
heat shock protein DnaJ 
class I heat-shock protein 
(molecular chaperone) 
DnaK 

heat shock protein GrpE 
transcription repressor of 
class I heat-shock gene 
HrcA 

unknown, highly similar to 
coproporphyrinogen III 
oxidase * 
Unknown, similar to 

oxidoreductase 
Unknown, similar to 
transcriptional regulator 

(MerR family) 
unknown, highly similar to 
GTP-binding protein LepA 
Unknown, similar to 
dipeptidases 
RNA polymerase sigma-54 
factor (sigma-L) 
Unknown, similar to B. 
subtilis CggR hypothetical 
transcriptional regulator 



WO 01/77335 

SeqID n° 1515 

SeqID n° 1516 
SeqID n° 1517 

SeqID n° 1518 

SeqID n° 1519 

SeqID n° 1520 

SeqID n° 1521 
SeqID n° 1522 

SeqID n°1523 

SeqID n°1524 

SeqID n° 1525 
SeqID n° 1526 
SeqID n° 1527 

SeqID n° 1528 

SeqID n° 1529 
SeqID n" 1530 

SeqID n e 1531 

SeqID n° 1532 
SeqID n° 1533 
SeqID n° 1534 

SeqID n°1535 

SeqID n°1536 

SeqID n° 1537 
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LM-2936.1 From 2531310 to 2532320 unknov/n, highly similar to 

glyceraldehyde 3- 
phosphate dehydrogenase 
LM-2937.1 From 2529985 to 2531175 unknown, highly similar to 

phosphoglycerate kinase 
LM-2939.1 From 2529184 to 2529939 unknown, highly similar to 

those phosphate 
isomerase 

LM-294.1 From 2826515 to 2828236 Unknown, similar to ABC 

transporter, ATP-binding 
protein 

LM-2942.1 From 2527650 to 2529182 unknown, highly similar to 

phosphoglycerate mutase 
LM-2944.2 From 2526222 to 2527514 unknown, highly similar to 

6nol3S6 

LM-2945.1 From 2525946 to 2526119 Unknown 
LM-2946.2 From 2525094 to 2525813 Unknown, similar to 

lipolytic enzyme 

LM-2950.2 From 316873 to 318375 Unknown, similar to heat- 
shock protein htrA serine 
protease 

LM-2952.1 From 315945 to 316775 Unknown, conserved 

hypothetical protein similar 
to B. subtilis YycJ protein 
LM-2953.1 From 314984 to 315823 Unknown, similar to B. 

subtilis Yycl protein 
LM-2954.1 From 313659 to 314981 Unknown, similar to B. 

subtilis YycH protein 
LM-2957.1 From 311830 to 313662 Unknown, similar to two- 
component sensor histidine 
kinase 

LM-2958.1 From 310932 to 31 1645 Unknown, similar to two- 
component response 
regulator 

LM-2959.2 From 309538 to 31071 9 Unknown, similar to 

aminotransferase 

LM-296. 1 From 2828236 to 2830008 Unknown, similar to ABC 

transporter, ATP-binding 
protein 

LM-2960.3 From 2692097 to 2693041 unknown, highly similar to 

RNA polymerase (alpha 
subunit) 

LM-2962.2 From 26931 51 to 2693540 ribosomal protein S1 1 
LM-2964.2 From 2693563 to 2693928 ribosomal protein S1 3 
LM-2965.1 From 2694164 to 2694382 unknown, highly similar to 

initiation factor I F-l 
LM-2967.1 From 2694769 to 2695416 unknown, highly similar to 

adenylate kinases 
LM-2968.1 From 2695476 to 2696771 unknown, highly similar to 

preprotein translocase 
subunit 

LM-297. 1 From 2830284 to 2830523 Unknown 
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SeqID 
SeqID 
SeqID 
SeqID 
SeqID 
SeqID 
SeqID 
SeqID 



n° 1538 
n° 1539 
n°1540 
n° 1541 
n° 1542 
n° 1543 
n° 1544 
n°1545 



SeqID n° 
SeqID n° 
SeqID n° 
SeqID n° 
SeqID n° 
SeqID n° 



1546 
1547 
1548 
1549 
1550 
1551 



LM-2970.1 
LM-2972.1 
LM-2973.1 
LM-2974.1 
LM-2975.1 
LM-2977.1 
LM-2979.1 
LM-298.1 



LM-2981.1 
LM-2983.1 
LM-2984.1 
LM-2985.2 
LM-2986.2 
LM-2990.2 



From 2696771 
From 2697463 
From 2697988 
From 2698387 
From 2698954 
From 2699600 
From 2700166 
From 2830542 



From 2700515 
From 2701435 
From 2701872 
From 2702532 
From 2702909 
From 1593100 



to 2697211 
to 2697966 
to 2698347 
to 2698923 
to 2699352 
to 2700139 
to 2700477 
to 2831879 



to 2700883 
to 2701869 
to 2702528 
to 2702888 
to 2703187 
to 1594407 



SeqID n° 1552 LM-2991.1 From 1592167 to 1593096 



SeqID n° 1553 LM-2992.1 From 1591448 to 1592170 



SeqID n° 1554 LM-2993.1 From 1590477 to 1591451 



SeqID n° 1555 LM-2995.1 From 1589175 to 1590464 



SeqID n° 1556 LM-2997.1 From 1586201 to 1588852 

SeqID n° 1557 LM-2999.2 From 1584850 to 1586139 

SeqID n° 1558 LM-3.1 From 2712931 to 2713365 

SeqID n° 1559 LM-3000.1 From 1583928 to 1584638 



SeqID n° 1560 LM-3001.1 From 1583242 to 1583916 

SeqID n° 1561 LM-3004.1 From 1581777 to 1582790 

SeqID n° 1562 LM-3005.1 From 1580804 to 1581691 

SeqID n° 1563 LM-3006.1 From 1580283 to 1580801 



ribosomal protein L15 
ribosomal protein S5 
ribosomal protein L18 
ribosomal protein L6 
ribosomal protein S8 
ribosomal protein L5 . 
ribosomal protein L24 
Unknown, similar to D- 
alanyl-D-alanine 
carboxypeptidase 
(penicillin-binding protein 5) 
ribosomal protein L14 
ribosomal protein L16 
ribosomal protein S3 
ribosomal protein L22 
ribosomal protein S19 
unknown, highly similar to 
glutamyl-tRNA reductase 
unknown, highly similar to 
porphobilinogen 
deaminases 
(hydroxymethylbilane 

synthase) 
Unknown, similar to 
uroporphyrinogen III 
cosynthase (HemD) 
unknown, highly similar to 
delta-aminolevulinic acid 

dehydratases 
(porphobilinogen synthase) 
unknown, highly similar to 
glutamate-1-semialdehyde 
2, 1-aminotransfe rases 
valyWRNA synthetase 
unknown, similar to Folyl- 
polyglutamate synthetase 
Unknown 
Unknown, similar to B. 
subtil is late competence 
protein ComC (type IV 
prepilin peptidase) 
Unknown, similar to DNA 

repair protein RadC 
unknown, similar to cell- 
shape determining protein 
MreB 

unkown, similar to cell- 
shape determining protein 
MreC 

unknown, similar to cell- 
shape determining protein 
MreD 
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SeqID 


n° 


1564 


LM-3007.2 


From 


1579428 


SeqID 


n° 


1565 


LM-301.1 


From 


2832058 


SeqID 


n° 


1566 


LM-3010.1 


From 


1149406 


SeqID 


n° 


1567 


LM-3011.1 


From 


1149011 


SeqID 


n° 


1568 


LM-3012.1 


From 


1147603 


SeqID 


n° 


1569 


LM-3013.1 


From 


1146228 


SeqID 


n° 


1570 


LM-3014.1 


From 


1145941 


SeqID 


n° 


1571 


LM-3016.1 


From 


1145211 


SeqID 


n° 


1572 


LM-3017.1 


From 


1144533 


SeqID 


n° 


1573 


LM-3018.1 


From 


1142099 




n° 
1 1 






null! 


1 Iwww*tVJ 


SeqID 


n° 


1575 


LM-3022.1 


From 


1138930 


SeqID 


n° 


1576 


LM-3023.1 


From 


1137997 


SeqID 


n' 


1577 


LM-3024.1 


From 


1137533 


SeqID 


n° 


1578 


LM-3025.3 


From 


1254459 


SeqID 


n° 


1579 


LM-3027.3 


From 


1253379 


SeqID 


n° 


1580 


LM-3028.1 


From 


1252667 


SeqID 


n° 


1581 


LM-3029 1 


From 


1249360 


SeqID 


n° 


1582 


LM-3030.1 


From 


1248892 


SeqID 


n° 


1583 


LM-3031.2 


From 


1245384 


SeqID 


n° 


1584 


LM-3032.2 


From 


1244669 


SeqID 


n° 


1585 


LM-3035.2 


From 


1608166 



PCT/FR01/01118 



to 


1580105 

1 WW 1 WW 


unknown similar to cell- 

V4I ll\l IWVVI l| «9llllllWl IW vvl 1 






division inhibition fseotum 






placement) Drotein MinC 


to 


2833725 


I Inknnwn similar to 






acvlase and diesterasp 

MWJlMWV Ul IV4 VflWWkWlUWW 


to 


1149720 


unknown hiahlv similar to 






TN916 ORF23 

1 I1W IW V-X I \ 1 £m\J 


to 


1149385 


unknown hiohlv similar tn 






TN916 0RF22 


to 


1149003 


unknown hiohlv similar to 

\ji irvi iwvvi i, i iimj u y otiimcii wj 






TN916 0RF21 


to 


1147412 


unknown hiahlv similar to 

mi 1 1 vl iv in 1 1 1 II Ml H J wllllllwll IW 






TN916ORF20 


to 


1146231 


unknown similar to 

\A 1 MM 1 v If 1 1 j w II I III I Lw 






unknown proteins 


to 


1145711 


unknown hiahlv similar to 

ui ir\i iwvvi i| iiiytiiy oniiiicii iu 






TN916 0RF18 


to 


1144928 


unknown hiohlv similar tn 
ui iim iwvvi i, i uyi 1 1 jf on i Midi i\j 






I INC/ IU V^rxr I I 


in 
IU 




unknown, nigniy similar to 






1 INt7 IO VJrxr IO 


to 


I l*t^CUs?57 


uniMiuwri, niyniy oiiiiitar to 






I INw I w Vw/IA 1 l%? 


to 


i iowwhu 


unKnown, nigniy similar to 






TNQ16 ORF14 and to I 

l INw I w \j r\ i l *r a) iU WJ 






1 1 iui iwL»y tuy ci led rou 






orotpin 


to 


1138914 

1 1 WW w | "T 


unknown hiahlv similar to 

mi irvi iwvvi i| i nyj \\y oijj mcii iw 






TN916 0RF13 


to 


1137868 


unknown similar to 

|l\l IV f f 1 l| Mil 1 IllVtt K\*S 






cadmium efflux system 






accessory proteins 


to 


1254722 


unknown similar to B 

\A 1 1 i VI IWVVI l| wll 1 IIIWI VW t— 






subtilis YshA protein 


to 


1254311 


unknown similar to B 

WI ll\l IVTII l| Wll 1 IIIUI IV LmJ m 






subtilis ribonuclease HIM 

WWWUIIW 1 IMVI lUWlwUtfw 1 llll 


to 


1253341 


Unknown similar to uracil- 

V_/ | | l\l IvTrl 1 1 wll 1 III Wtl K\y U| Q vll 






DNA alvcosvlase 


to 


1252560 

1 fc.WA.WWW 


unknown* similar to 

Ul li\i iwvvi || oil I iiiui lv 






transDorter fto B subtilis 






YdgH protein) 


to 


1249344 


unknown; similar to 






transcriptional regulator 






(MarR family). 


to 


1248794 


unknown, similar to 






different proteins 


to 


1245370 


unknown, similar to ABC 






transporter, ATP-binding 






proteins 


to 


1609923 


unknown, highly similar to 






pyruvate kinases 
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SeqID n° 


1586 


LM-3036.1 


From 1607671 


to 1608048 


SeqID n° 


1587 


LM-3037.1 


From 1607053 


to 1607514 


SeqID n° 


1588 


LM-3038.1 


From 1605895 


to 1607016 


SeqID n° 


1589 


LM-3039.1 


From 1604613 


to 1605875 


SeqID n° 


1590 


LM-304.1 


From 2833834 


to 2835987 


SeqID n° 
SeqID n° 


1591 
1592 


LM-3041.1 
LM-3042.1 


From 1601820 
From 1600975 


to 1604447 
to 1601796 


SeqID n° 


1593 


LM-3043.2 


From 1600356 


to 1600958 


SeqID n° 


1594 


LM-3044.2 


From 1875574 


to 1876560 


SeqID n° 


1595 


LM-3049.1 


From 1876575 


to 1880135 


SealD n° 


1596 


LM-3050 1 


From 1880158 


to 1880847 


SeqID n° 


1597 


LM-3053.1 


From 1881381 


to 1882124 


SeqID n° 


1598 


LM-3054.1 


From 1882128 


to 1883069 


SeqID n° 


1599 


LM-3056.1 


From 1883062 


to 1884075 


SeqID n° 


1600 


LM-3058.2 


From 1884096 


to 1884665 


SeqID n° 


1601 


LM-3059.2 


From 817914 


to 818762 


SeqID n° 
SeqID n° 


1602 
1603 


LM-3062.1 
LM-3065.3 


From 813451 
From 811813 


to 817878 
to 813189 


SeqID n° 
SeqID n° 
SeqID n° 


1604 
1605 
1606 


LM-3066.2 
LM-3067.1 
LM-3068.1 


From 2159224 
From 2157949 
From 2157383 


to 2159604 
to 2159094 
to 2157844 


SeqID n° 


1607 


LM-3069.1 


From 2156694 


to 2157386 


SeqID n° 


1608 


LM-307.1 


From 2836011 


to 2837783 
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Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
unknown, highly similar to 
citrate synthase subunit II 
unknown, highly similar to 
isocitrate dehyrogenases 
Unknown, similar to DNA 
topoisomerase III 

DNA polymerase I 
unknown, highly similar to 
formamidopyrimidine-DNA 
glycosylases 
Unknown, similar to 
unknown proteins 
Unknown, similar to FtsY of 
E. coli and SRP receptor 

alpha-subunit 
Unknown, similar to Smc 

protein essential for 
chromosome condensation 
and partition 
. Unknown, similar to 

ribonuclease III 
Unknown, similar to 3- 
ketoacyl-acyl carrier protein 
reductase 
Unknown, similar to 
malonyl CoA-acyl carrier 

protein transacylase 
Unknown, similar to pIsX 
protein involved in fatty 
acid/phospholipid synthesis 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
conserved hypothetical 
proteins 
Unknown 
Unknown, similar to amino 
acid transporter 
Unknown 
unknown 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
glycoprotease 
Unknown, similar to ATP- 
dependent DNA helicases 
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SeqID n° 1609 


LM-3070.1 


From 2156242 


to 2156697 


SeqID n° 1610 


LM-3072.1 


From 2155223 


to 2156245 


SeqID n° 1611 


LM-3073.1 


From 2153720 


to 2154679 


SeqID n° 1612 


LM-3074.1 


From 2151744 


to 2153696 


SeqID n° 1613 


LM-3075.1 


From 2150786 


to 2151433 


SeqID n° 1614 


LM-3076.2 


From 2149868 


to 2150554 


SeqID n° 1615 


LM-3077.2 


From 992978 


to 993886 


SeqID n° 1616 


LM-3080.1 


From 993900 


to 995126 


SeqID n" 1617 


LM-3081.1 


From 995210 


to 995767 


SeqID n° 1618 


LM-3083.1 


From 995790 


to 996704 


SeqID n° 1619 


LM-3084.1 


From 996739 


to 997557 


SeqID n° 1620 


LM-3085.1 


From 997795 


to 998379 


SeqID n° 1621 
SeqID n° 1622 


LM-3086.2 
LM-3087.2 


From 998396 
From 999022 


to 998863 
to 999690 


SeqID n° 1623 


LM-3088.1 


From 999722 


to 1000516 


SeqID n° 1624 


LM-3089.1 


From 1000535 


to 1001425 


SeqID n° 1625 


LM-309.2 


From 2837944 


to 2839410 


SeqID n° 1626 


LM-3090.1 


From 1001503 


to 1002291 


SeqID n° 1627 


LM-3091.1 


From 1002319 


to 1003593 



Unknown, similar to 
ribosomal protein alanine 
acety transferase 
Unknown, similar to 
glycoprotein endopeptidase 
Unknown, similar to 
unknown proteins 
Unknown, similar to ABC 
transporter (ATP-binding 
protein) 
Unknown, similar to a 
putative DNA binding 

proteins 
Unknown, similar to 
unknown proteins 
unknown, similar to 

proteases 
unknown, similar to 
proteases 
unknown, Listeria epitope 
LemA 
unknown, similar to 
putative heat shock protein 
HtpX, Listeria epitope 

LemB 
unknown, similar to B. 
subtilis YjbH protein 
unknown, similar to B. 
subtilis YjbK protein 

unknown 
unknown, similar to B. 
subtilis YjbM protein 
unknown, similar to 
conserved hypothetical 
proteins like to B. subtilis 
YjbN protein 
unknown, similar to 
ribosomal large subunit 
pseudouridine synthetase 

Unknown, similar to 
inosine-monophosphate 

dehydrogenase 
unknown, similar to enoyl- 
acyl-carrier protein 
reductase 
DltD protein for D-alanine 
esterification of lipoteichoic 
acid and wall teichoic acid 
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SeqlD n° 1628 LM-3092.1 From 1003855 to 1005039 

SeqID n° 1629 LM-3093.2 From 1005036 to 1006568 

SeqlD n° 1630 LM-3095.2 From 1364031 to 1364570 

SeqID n° 1631 LM-3097.1 From 1364669 to 1366207 

SeqlD n° 1632 LM-3098.1 From 1366457 to 1367425 

SeqID n° 1633 LM-3099.1 From 1367559 to 1368650 

SeqID n° 1634 LM-3100.1 From 1368668 to 1368985 

SeqID n° 1635 LM-3101.1 From 1368982 to 1369449 

SeqID n° 1636 LM-3102.1 From 1369412 to 1369696 

SeqID n° 1637 LM-3103.1 From 1369683 to 1370111 

SeqID n° 1638 LM-3104.1 From 1370108 to 1370431 

SeqID n° 1639 LM-3105.1 From 1370445 to 1371476 

SeqID n° 1640 LM-3106.1 From 1371454 to 1372476 

SeqID n° 1641 LM-3107.1 From 1373015 to 1374103 

SeqID n° 1642 LM-3110.3 From 1374119 to 1375465 

SeqID n° 1643 LM-3111.1 From 1375462 to 1376928 



SeqID n° 1644 LM-3112.1 From 1376968 to 1377348 

SeqID n° 1645 LM-3113.1 From 1377410 to 1377682 

SeqID n° 1646 LM-3114.2 From 1377695 to 1378660 

SeqID n° 1647 LM-31 15.3 From 1962698 to 1963243 



DltB protein for D-alanine 
esterification of lipoteichoic 
acid and wall teichoic acid 

D-alanine-activating 
enzyme (dae), D-alanine- 
D-alanyl carrier protein 

ligase (del) 
unknown, similar to 5- 
formyltetrahydrofolate 

cyclo-ligase 
unknown similar to B. 

subtilis yqgP 
Unknown, similar to 
glucose kinase 
unknown, similar to B. 
subtilis YqgU protein 
Unknown, weakly similar to 
B. subtilis comG operon 

protein 7 (comGG) 
Unknown, similar to B. 
subtilis comG operon 
protein 6 
Unknown, similar to comG 
operon protein 5 (comGE) 
Unknown, similar to comG 
operon protein 4 (comGD) 
Unknown, similar to B. 
subtilis comG operon 

protein 3 
Unknown, similar to B. 
subtilis comG operon 

protein 2 
Unknown, similar to B. 
subtilis comG operon 

protein 1 
Unknown, similar to 
aminomethyltransferase 
Unknown, similar to glycine 

dehydrogenase 
(decarboxylating) subunit 1 
Unknown, similar to glycine 

dehydrogenase 
(decarboxylating) subunit 2 
unknown 
Unknown 
Unknown, similar to B. 
subtilis YqhQ protein 
unknown, similar to 
conserved hypothetical 
proteins 
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SeqlDn°1648 LM-3116.2 From 1962225 to 1962566 

SeqlDn°1649 LM-3119.1 From 1960496 to 1961644 

SeqlDn°1650 LM-312.1 From 130652 to 131380 

SeqlDn°1651 LM-3120.1 From 1958968 to 1960476 

SeqlDn°1652 LM-3121.1 From 1958162 to 1958740 

SeqlDn°1653 LM-3124.1 From 1956848 to 1958155 

SeqlDn°1654 LM-3126.1 From 1955598 to 1956656 

SeqlDn°1655 LM-3127.2 From 1955205 to 1955474 

SeqlDn°1656 LM-3128.3 From 1954255 to 1955127 

SeqID n" 1657 LM-3129.1 From 248555 to 249013 



SeqID n° 1658 


LM-313.1 


From 130249 


to 130671 


SeqID n° 1659 


LM-3130.1 


From 249026 


to 249544 


SeqID n° 1660 


LM-3131.1 


From 249541 


to 250563 


SeqID n° 1661 


LM-3135.1 


From 250592 


to 253054 


SeqID n° 1662 


LM-3136.2 


From 253200 


to 254573 


SeqID r>° 1663 


LM-3139.2 


From 254707 


to 255780 


SeqID n° 1664 


LM-314.1 


From 129694 


to 130230 


SeqID n° 1665 


LM-3140.2 


From 255800 


to 256498 


SeqID n° 1666 
SeqID n° 1667 


LM-3141.2 
LM-3142.2 


From 348586 
From 349326 


to 349065 
to 350228 


SeqID n° 1668 


LM-3143.1 


From 350392 


to 351264 


SeqID n° 1669 


LM-3147.1 


From 355643 


to 356323 



unknown, similar to 
hypothetical proteins 
unknown, similar to 
conserved hypothetical 
proteins 
Unknwon, similar to 
autolysin: N- 
acetylmuramoyl-L-alanine 
amidase 
unknown, similar to 
probable thermostable 
carboxypeptidases 
unknown, similar to 
xanthine 
phosphoribosyltransferase 
unknown, similar to 
probable permeases 
unknown, similar to 

chitinases 
unknown, similar to 
ribosomal protein S14 
unknown, similar to 5 -3 

exonuclease 
Unknown, highly similar to 
transcription repressor of 
class III stress genes 

(CtsR) 
Unknwon, similar to a 
protein from Bacteriophage 
phi-105 (ORF 45) 
Unknown, similar to B. 
subtilis YacH protein 
Unknwon, similar to 
arginine kinase 
endopeptidase Clp ATP- 

binding chain C 
unknown, similar to DNA 

repair protein Sms 
Unknown, highly similar to 

B. subtilis YacL protein 
Unknwon, weakly similar to 
protein gp20 from 
Bacteriophage A1 18 
Unknown, similar to 
nucleotidyl transferase; 
pyrophosphorylase 

Unknown 
Unknown, similar to 
transcriptional regulators 

Unknown, similar to 
transcriptional regulators 
Unknown 



WO 01/77335 



PCT/FR01/01118 



124 



SeqID n° 1670 LM-3149.2 From 2602249 to 2602683 



SeqID n° 1671 LM-315.1 From 129218 to 129697 
SeqID n° 1672 LM-3150.1 From 2601632 to 2601997 

SeqID n° 1673 LM-3151.1 From 2600408 to 2601241 



SeqID n° 1674 LM-3152.1 From 2599548 to 2600282 



SeqID n° 1675 LM-3153.1 From 2598423 to 2599547 



SeqID n° 1676 LM-3154.1 From 2597363 to 2598415 



SeqID n° 1677 LM-3155.1 From 2596266 to 2597324 



SeqID n° 1678 LM-31 56.1 From 2595485 to 2596090 
SeqID n° 1 679 LM-31 57.1 From 2594908 to 2595543 



SeqID n° 1 680 LM-31 58. 1 From 2593837 to 2594523 



SeqID n° 1681 LM-31 59.1 From 2592965 to 2593816 

SeqID n° 1682 LM-316.1 From 128628 to 129203 

SeqID n° 1683 LM-3162.1 From 2591451 to 2592770 

SeqID n° 1684 LM-3163.2 From 2590802 to 2591458 



SeqID n° 1685 LM-3164.2 From 2590044 to 2590607 

SeqID n° 1686 LM-3167.1 From 2020738 to 2021256 
SeqID n° 1687 LM-3168.1 From 2021272 to 2023062 



Unknwon, similar to 
hydroxymyristoyl-(acyl 
carrier protein) 
dehydratase 
Unknwon 
Unknown, similar to single- 
strand DNA-binding protein 
Unknwon, similar to 
hypothetical cell wall 
binding protein from B. 

subtilis 
Unknown, similar to B. 
subtilis TagA protein 
involved in polyglycerol 
phosphate biosynthesis 
Unknwon, similar to B. 

subtilis O- 
succinylbenzoate-CoA 

synthase (MenC) 
Unknown, similar to B. 
subtilis TagO teichoic acid 
linkage unit synthesis 

protein 
Unknown, similar to B. 

subtilis putative 
transcriptional regulator 
LytR 
unknwon 
Unknown, similar to 
conserved hypothetical 

proteins 
Unknown, similar to B. 
subtilis two-component 
response regulator DegU 

Unknown, similar to B. 
subtilis YviA (DegV) protein 
unknown 
unknown, similar to late 
competence protein comFA 
unknown, similar to late 
competence protein 

comFC 
Unknown, similar to 
conserved hypothetical 
proteins like to B. subtilis 

YvyD protein 
Unknown, similar to similar 
to acyl-CoA hydrolase 
Unknown, similar to two- 
component sensor histidine 
kinase (ResE) 
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ouLjiu n 


IDOO 




Emm 909^1 M 
rlOiTl c\j£.0 I DO 


tn 9090P7Q 


unknown, similar to two- 
component response 
regulator (ResD) 




1RRQ 


Llvl-v> I / . 1 


From 

r i wi 1 1 i coo i *t 


tn 19ftft1** 
IU I *iOO I O 


unKnown 


SeqID n° 


1690 


LM-3171.1 


From 2024062 


to 2024796 


Unknown, similar to 

unWnnwn nmtpinc 


SeqID n° 


1691 


LM-3172.1 


From 2024799 


to 2025395 


Unknown, similar to 

i inknovwn nmfoinc 

UI liM IUWI 1 pi Uldl Io 


SeqID n° 


1692 


LM-3174.1 


From 2025392 


to 2026141 


Unknown, similar to 

iinknnwn nmtoinc 
UllfMIUWI) piULciiio 


SeqID n° 


1693 


LM-3177.1 


From 2026157 


to 2027467 


Unknown, similar to 
diaminopimelate 

UCU3I UL/Ayidoti 


SeqID n° 


1694 


LM-3178.1 


From 2027648 


to 2028466 


Unknown, similar to purine- 
nucleoside phosphorylase 


oeqiu n 


ioyo 


I KA Oi7Q 9 


rrom /U/O4O0 


to /uzyooy 


Unknown, similar to 
* phosphopentomutase 


oeqiu n 


IOcJO 


Llvl-O lot./ 


rrom /u/yoyo 


io zuouoy i 


Unknown, similar to 
integrase/recombinase 


SeqID n° 


1697 


LM-3 184.2 


From 635317 


to 636423 


Unknown, similar to 
nomosenne \j- 
acetyltransferase 


SeqID n° 


1698 


LM-3 186.1 


From 636440 


to 637717 


Unknown, similar to O- 
acetylhomoserine 
sulfhydrylase 


oeqiu n 


noyy 


LM-ol.OO.T 


rrom oooioo 


to 000092 


Unknown, similar to 
unknown proteins 


SeqID n° 


1700 


LM-3189.1 


From 638781 


to 639482 


Unknown, similar to 
transcription regulator 
UKr/riNK tamiiy 


SeqID n° 


1701 


LM-319.1 


From 127188 


to 128324 


Unknown, similar to protein 
gp18from Bacteriophage 

A A A O 

A118 


SeqID n° 


1702 


LM-3190.2 


From 639558 


to 640106 


Unknown, similar to 
piumiiio uivoivcJu in Dioun 












rneiaDonsm ^dioyj 




I M-^iQI 9 
LIVI-O 1 \7 1 »Z 


rruiii I do to id 




unKnown, similar io two- 
component response 

rani ilatnrc 


SeqID n° 


1704 


LM-3192.1 


From 1538498 


to 1539937 


Unknown, similar to two- 
component sensor histidine 
kinase 


oeqiu n 


I (KJD 


Ltvro iy*t. i 


Frnm 1^QQ7Q 

rrom looyy/y 


tn i*v4907^ 


unknown, similar to 
exodeoxyribonuclease V 


SeqID n° 


1706 


LM-3195.1 


From 1542403 


to 1543041 


Unknown, similar to 










unknown proteins 


SeqID n° 


1707 


LM-3196.1 


From 1543082 


to 1543822 


Unknown, similar to 
unknown proteins 


SeqID n° 


1708 


LM-3198.1 


From 1543940 


to 1545055 


Unknown, similar to 
putative tRNA (5- 
methylaminomethyl-2- 
thiouridylate)- 
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SeqID n° 1 709 LM-32. 1 From 2733230 to 2735245 

SeqlDn°1710 LM-3200.2 From 1545074 to 1546222 

SeqID n° 1711 LM-3201.2 From 1527422 to 1527904 

SeqID n° 1712 LM-3203.2 From 1526748 to 1527371 

SeqID n° 1713 LM-3204.1 From 1525997 to 1526698 

SeqID n° 1714 LM-3205.1 From 1524154 to 1525962 

SeqID n° 1715 LM-3206.2 From 1523471 to 1523992 

SeqID n° 1716 LM-3208.2 From 1522374 to 1523474 

SeqID n° 1717 LM-3209.1 From 1521533 to 1522354 

SeqID n° 1718 LM-321.1 From 126360 to 127178 

SeqID n° 1719 LM-3210.1 From 1521243 to 1521533 

SeqID n° 1720 LM-3211.1 From 1520659 to 1521225 

SeqID n° 1721 LM-3212.1 From 1520097 to 1520672 

SeqID n° 1722 LM-3213.1 From 1519736 to 1520092 

SeqID n° 1723 LM-3214.1 From 1518950 to 1519681 

SeqID n° 1724 LM-3215.1 From 1518245 to 1518847 

SeqID n° 1725 LM-3216.1 From 1517613 to 1518173 

SeqID n° 1726 LM-3217.2 From 1515356 to 1517578 



SeqID n° 1727 LM-3218.2 From 1418803 to 1420095 



methyltransferase 



Unknown, similar to 
transketolase 
Unknown, simiiarto iron- 
sulfur cofactor synthesis 
protein 
Unknown, similar to 
transcription elongation 
factor GreA 
Unknown, similar to 
unknown proteins 
Unknown, similar to 5 - 
methylthioadenosine/S- 
adenosylhomocysteine 
nucleosidase 
Unknown, similar to 

oligopeptidase 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
shikimate 5- 
dehydrogenase (AroD) 
Unknown, similar to phage 
proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 

Unknown 
Unknown, simiiarto 
unknown proteins 
Unknown, similar to 
integral membrane protein 
ComEA 
Unknown, simiiarto B. 
subtilis ComEB protein 
Unknown, similar to 
putative integral membrane 
protein ComEC specifically 
required for DNA uptake 
but not for binding 
Unknown, similar to 
putative proteases 
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SeqlDn°1728 LM-3220.1 From 1417735 

SeqlDn°1729 LM-3221.1 From 1416686 

SeqlDn°1730 LM-3223.1 From 1415152 

SeqiDn°1731 LM-3226.3 From 1413646 

SeqlDn°1732 LM-3228.3 From 1412455 

SeqlDn°1733 LM-3231.1 From 1410148 

SeqlDn°1734 LM-3234.2 From 843311 

SeqID n° 1735 LM-3235.1 From 842781 

SeqIDn°1736 LM-3237.1 From 842313 

SeqID n° 1737 LM-3239.1 From 841 157 

SeqID n° 1738 LM-324.1 From 124495 

SeqID n° 1739 LM-3241 .1 From 840200 

SeqID n° 1740 LM-3243.3 From 839520 

SeqID n° 1741 LM-3244.3 From 838751 

SeqID n° 1742 LM-3246,1 From 837548 

SeqID n° 1743 LM-3248.1 From 836745 

SeqID n° 1744 LM-3249.1 From 835939 

SeqID n° 1745 LM-3250.1 From 834845 

SeqID n° 1746 LM-3251.2 From 834287 



to 1 41 8685 Unknown, similar to sugar 
ABC transporter, permease 
protein 

to 1417738 Unknown, similar to 
permease proteins 
to 1416693 Unknown, similar to sugar 
ABC transporter, ATP- 
binding protein 
to 1414719 Unknown, CD4+ T cell- 
stimulating antigen, 
lipoprotein 
to 1413294 Unknown, similar to 
pyrroline-5-carboxyIate 
reductase 

to 1412421 Unknown, similar to DNA 

translocase 
to 843724 Unknown, similar to E. cbli 

PhnB protein 
to 843287 Unknown, similar to B. 

subtilis regulatory protein 
PaiA 

to 842765 unknown, similar to 

transcription regulators 
to 842086 unknown, similar to 

oxidoreductases 
to 126363 unknown, similar to 
bacteriophage minor tail 
proteins 

to 841 072 unknown, similar to 

fructokinases 
to 840062 unknown 
to 839452 unknown, similar to 
carbonic anhydrase 
to 838621 Unknown, similar to 
spermidine/putrescine- 
binding protein 
to 837551 unknown, similar to 

spermidine/putrescine ABC 
transporter, permease 
• protein 

to 836748 unknown, similar to 

spermidine/putrescine ABC 
transporter, permease 
protein 

to 835939 unknown, similar to 

spermidine/putrescine ABC 
transporter, ATP-binding 
protein 

to 834829 unknown, similar to 

transcription regulator ■ 
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SeqlDn°1747 LM-3252.2 From 1724612 to 1725565 
SeqID n° 1748 LM-3254.1 From 1725584 to 1726993 
SeqID n° 1749 LM-3255.1 From 1727009 to 1727827 



SeqID n 1750 


LM-3256.1 


From 1727824 


to 1728651 


SeqID n 1751 


LM-3258.1 


From 1728653 


to 1730395 


SeqID n° 1752 
SeqID n° 1753 


LM-326.1 
LM-3260.2 


From 124104 
From 1730392 


to 124508 
to 1731780 


SeqID n° 1754 


LM-3261.2 


From 2308741 


to 231 1464 


SeqID n 1755 


LM-3262.1 


From 2311461 


to 2312696 


SeqID n 1756 


LM-3263.1 


From 2312839 


to 2313192 


SeqID n 1757 


LM-3265.1 


From 2313274 


to 2314407 


SeqID n° 1758 


LM-3266.1 


From 2314511 


to 2315878 


SeqID n° 1759 


LM-3267.1 


From 2315948 


to 2316742 


SeqID n° 1760 


LM-3268.1 


From 2316739 


to 2317644 


SeqID n° 1761 


LM-3269.2 


From 2317950 


to 2320094 


SeqID n° 1762 


LM-3274.2 


From 558526 


to 560187 


SeqID n° 1763 


LM-3275.1 


From 558037 


to 558480 


SeqID n° 1764 


LM-3276.1 


From 557178 


to 557930 


SeqID n° 1765 


LM-3278.1 


From 555660 


to 556976 


SeqID n° 1766 
SeqID n° 1767 


LM-328.1 
LM-3281.1 


From 123760 
From 554624 


to 124062 
to 555628 


SeqID n° 1768 


LM-3283.2 


From 553078 


to 554493 



unknown, similar to ABC 
transporter and adhesion 
proteins 
similar to O- 
succinylbenzoic acid-CoA 
ligase 
unknown, similar to 
dihydroxynapthoic acid 

synthetase 
unknown, similar to prolyl 
aminopetidases 
unknown, similar to 2- 
succinyl-6-hydroxy-2,4- 

cyclohexadiene-1- 
carboxylate synthase / 2- 
oxoglutarate decarboxylase 
unknown 
unknown, similar to 
menaquinone-specific 
isochorismate synthase 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
fumarate hydratase 
Unknown, similar to 
unknown proteins 
Unknown, similar to ABC 
transporter (ATP-binding 
protein) 
Unknown, similar to 
penicillin-binding protein 

Unknown, similar to 
putative sulfate transporter 
Unknown, similar to B. 
subtilis YybC protein 
Unknown, similar to 
transcription regulator 
Unknown, similar to 6- 
phospho-beta-glucosidase 
Unknwon 
Unknown, similar to 
transcription regulator 
Unknown, similar to 
multidrug resistance 
protein 
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SeqlDn° 1769 LM-3284.1 

SeqID n° 1770 LM-3285.1 

SeqlDn 0 1771 LM-3286.1 

SeqID n° 1772 LM-3288.1 

SeqID n° 1773 LM-329.1 

SeqlDn 0 1774 LM-3290.1 



From 514004 

From 514681 

From 515391 

From 516076 
From 123200 
From 516746 



to 514381 

to 515058 

to 515750 

to 516636 
to 123712 
to 518446 



SeqID n° 1775 LM-3292.1 From 518597 to 519700 



SeqlDn 0 1776 LM-3293.1 From 519789 to 520268 

SeqID n° 1 777 LM-3294.2 From 520426 to 520791 

SeqID n° 1778 LM-3296.2 From 1561538 to 1561882 

SeqlDn 0 1779 LM-3298.1 From 1559081 to 1561432 



SeqlDn 0 1780 LM-3299.1 From 1558570 to 1559091 

SeqID n° 1781 LM-330.1 From 122798 to 123187 

SeqlDn 0 1782 LM-3301.1 From 1556148 to 1558364 

SeqlDn 0 1783 LM-3302.1 From 1555680 to 1556132 

SeqlDn 0 1784 LM-3303.2 From 1554360 to 1555643 

SeqlDn 0 1785 LM-3305.2 From 2279524 to 2279919 

SeqlDn 0 1786 LM-3306.1 From 2280238 to 2281206 

SeqlDn 0 1787 LM-3307.1 From 2281203 to 2282279 

SeqlDn 0 1788 LM-3309.1 From 2282297 to 2283331 

SeqID n° 1789 LM-331.1 From 122104 to 122520 

SeqlDn 0 1790 LM-3310.1 From 2283331 to 2284260 

SeqlDn 0 1791 LM-3312.2 From 2284539 to 2286215 



Unknown, putative 
secreted protein 
Unknown, putative 
secreted protein 
Unknown, putative 
secreted protein 
Unknown 
antigen A 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
conserved hypothetical 
proteins, highly similar to B. 
subtilis YloN protein 
Unknown 
Unknown 
Unknown, similar to 
unknown proteins 
Unknown, similar to single- 
stranded-DNA-specific 
exonuclease (RecJ) 
unknown, similar to 
adenine 
phosphoribosyltransferase 
antigen B 
unknown, similar to 
(p)ppGpp synthetase 
Unknown, similar to 
unknown proteins 
Unknown, similar to N- 
acetylmuramoyl-L-alanine 
amidase 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
oligopeptide ABC 
transporter (ATP-binding 
protein) 
Unknown, similar to 
oligopeptide ABC 
transporter (ATP-binding 
protein) 
Unknown, similar to 
oligopeptide ABC 
transporter (permease) 
unknown, similar to Antigen 
C 

Unknown, similar to 
oligopeptide ABC 
transporter (permease) 
Unknown, similar to 
pheromone binding protein 
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SeqID n° 1792 

SeqID n° 1793 

SeqID n°1794 

SeqID n° 1795 

SeqID n° 1796 

SeqID n° 1797 

SeqID n° 1798 

SeqID n° 1799 
SeqID n° 1800 
SeqID n° 1801 



LM-3315.1 

LM-3316.1 

LM-3319.1 

LM-332.1 

LM-3320.1 

LM-3321.1 

LM-3322.1 

LM-3323.1 
LM-3326.1 
LM-3327.1 



From 1528098 

From 1528724 

From 1529454 

From 121664 

From 1530727 

From 1531398 

From 1531716 

From 1532129 
From 1532493 
From 1535439 



SeqID n° 1802 LM-3329.2 From 1536143 
SeqID n° 1803 LM-333.1 From 121112 



to 1528727 

to 1529377 

to 1530524 

to 122092 

to 1531353 

to 1531700 

to 1532132 

to 1532401 
to 1535132 
to 1536134 

to 1537648 
to 121447 



SeqID n° 1804 LM-3330.2 From 512291 to 513268 



SeqID n° 1805 
SeqID n° 1806 
SeqID n° 1807 
SeqID n° 1808 
SeqID n° 1809 
SeqID n° 1810 



LM-3331.1 
LM-3332.1 
LM-3333.1 
LM-3334.1 
LM-3335.2 
LM-3336.3 



From 511966 
From 511215 
From 510163 
From 509400 
From 507723 
From 506368 



to 512208 
to 511562 
to 511212 
to 510287 
to 508733 
to 506997 



SeqID n° 1811 LM-3337.3 From 505351 to 506223 

SeqID n° 1812 LM-3338.2 From 1487699 to 1489579 

SeqID n° 1813 LM-3339.1 From 1489599 to 1490039 

SeqID n° 1814 LM-334.1 From 120654 to 121106 

SeqID n° 1815 LM-3340.1 From 1490229 to 1491053 

SeqID n" 1816 LM-3341.1 From 1491087 to 1493153 

SeqID n° 1817 LM-3342.1 From 1493146 to 1494036 



unknown, similar to Uridine 
kinase 
Unknown, similar to O- 
methyltransferase 
Unknown, similar to 
unknown proteins 
unknown, similar to Antigen 
D 

Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
Unknown 
alanyl-tRNA synthetase 
Unknown, similar to ABC 
transporter, ATP-binding 
protein 
Unknown, similar to 

transporter 
Unknwon, similar to 
putative repressor C1 from 
lactococcal bacteriophage 
Tuc2009 
Unknown, similar to 
oxetanocin A resistance 
protein oxrB 
Unknown 
Unknown 
Unknown 
Unknown 
Unknown 
Unknown, weakly similar to 
site-specific DNA- 
methyltransferase 
Unknown 
DNA primase 
Unknown, similar to 
unknown proteins 
Unknwon, similar to protein 
gp35 from Bacteriophage 
A118 
Unknown, similar to 
unknown protein 
Unknown, similar to glycyl- 
tRNA synthetase beta 
chain 

Unknown, similar to glycyl- 
tRNA synthetase alpha 
chain 
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SeqlDn°1818 LM-3343.1 From 1494317 to 1495084 

SeqlDn°1819 LM-3344.1 From 1495221 to 1495850 

SeqlDn°1820 LM-3345.1 From 1495899 to 1496804 

SeqlDn°1821 LM-3347.1 From 1496801 to 1497196 

SeqlDn°1822 LM-3348.1 From 1497221 to 1497616 

SeqlDn°1823 LM-3349.1 From 1497594 to 1498079 

SeqlDn°1824 LM-335.1 From 119740 to 120435 

SeqID n° 1825 LM-3355.1 From 2012582 to 201 3727 



SeqID n° 1826 

SeqID n°1827 

SeqID n° 1828 

SeqID n° 1829 
SeqID n° 1830 

SeqID n°1831 

SeqID n°1832 

SeqID n° 1833 

SeqID n°1834 

SeqID n°1835 



LM-3356.1 

LM-3357.1 

LM-3359.1 

LM-336.1 
LM-3360.1 

LM-3361.2 

LM-3362.2 

LM-3363.3 

LM-3364.2 

LM-3371.2 



From 2014086 

From 2014776 

From 2015819 

From 119063 
From 2016858 

From 2018258 

From 2019420 

From 2019686 

From 1867802 

From 1872724 



to 2014760 

to 2015738 

to 2016538 

to 119770 
to 2018261 

to 2019265 

to 2019644 

to 2020297 

to 1869460 

to 1873620 



SeqID n° 1836 LM-3373.2 From 1873764 to 1875116 



SeqID n° 1837 LM-3374.2 From 1875129 to 1875461 

SeqID n° 1838 LM-3375.2 From 1358834 to 1359103 

SeqID n° 1839 LM-3376.1 From 1359352 to 1361523 

SeqID n° 1840 LM-3377.1 From 1361564 to 1362604 



SeqID n° 1841 LM-3378.2 From 1362765 to 1363244 



Unknown, similar to B. 
subtilis RecO protein 
involved in DNA repair and 
homologous recombination 

Unknown 
Unknown, similar to GTP 
binding proteins 
Unknown, similar to 
cytidine deaminase 
Unknown, similar to 
diacylglycerol kinase 
Unknown, similar to 
unknown proteins 
Unknown, weakly similar to 
transcription regulators, 
Fnr/Crp family 
Unknown, similar to similar 
to ribosomal protein S1 like 
protein 
Unknown, similar to 

cytidylate kinase 
Unknown, similar to 

asparaginase 
Unknown, similar to 
unknown proteins 
Unknwon 
Unknown, similar to ATP- 
dependent DNA helicase 
Unknown, similar to 
unknown proteins 
Unknown, similar to 

ferredoxin 
Unknown, similar to 

unknown protein 
Unknown, similar to 

unknown protein 
Unknown, similar to 
protein-tyrosine 
phosphatase 
Unknown, similar to signal 
recognition particle protein 
Ffh 

Unknown, similar to 
unknown proteins 
ribosomal protein S15 
polynucleotide 
phosphorylase (PNPase) 

unknown, similar to 
conserved hypothetical 

proteins 
unknown, similar to B. 
subtilis YqzC protein 
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SeqlDn°1842 LM-3379.2 From 1363272 to 1363628 

SeqID n° 1843 LM-338.1 From 117841 to 118956 

SeqlDn°1844 LM-3380.2 From 1918681 to 1919505 

SeqID n° 1845 LM-3381.1 From 1919525 to 1920436 

SeqID n° 1846 LM-3382.1 From 1920436 to 1920900 

SeqID n° 1847 LM-3384.1 From 1920982 to 1922265 

SeqID n° 1848 LM-3385.1 From 1922439 to 1923809 

SeqID n° 1849 LM-3387.1 From 1923825 to 1924757 



SeqID n° 1850 LM-3389.1 From 1924754 to 1925596 

SeqID n° 1851 LM-3390.3 From 1925600 to 1926322 

SeqID n° 1852 LM-3391.3 From 262657 to 263262 

SeqID n° 1853 LM-3392.1 From 262064 to 262576 

SeqID n° 1854 LM-3393.1 From 261306 to 262061 

SeqID n° 1855 LM-3394.1 From 260896 to 261306 

SeqID n° 1856 LM-3398.1 From 259477 to 260892 

SeqID n° 1857 LM-340.1 From 1 16972 to 1 17805 

SeqID n° 1858 LM-3400.1 From 258856 to 259470 

SeqID n° 1859 LM-3403.3 From 256983 to 258458 

SeqID n° 1860 LM-3404.3 From 256491 to 256964 

SeqID n° 1861 LMr3405.2 From 1862459 to 1862893 

SeqID n° 1862 LM-3406.2 From 1862893 to 1863480 



unknown, similar to B. 
subtilis YqzD protein 
Unknown, similar to lipase 
unknown, similar to 
unknown proteins 
unknown, similar to 
conserved hypothetical 
proteins 
unknown, highly similar to 
signal peptidase II 
unknown, similar to 
conserved hypothetical 
proteins 
unknown, similar to 
conserved hypothetical 
proteins 
unknown, similar to 
adhesion binding proteins 

and lipoproteins with 
multiple specificity for metal 

cations 
unknown, similar to integral 
membrane proteins, ABC 
transporter 
unknown, similar to 
probable ABC transporter, 

ATP-binding proteins 
RNA polymerase sigma-30 
factor (sigma-H) 
Unknown, similar to B. 
subtilis Yacp protein 
Unknown, similar to 
conserved hypothetical 
proteins like to B. subtilis 

YacO protein 
Unknown, highly similar to 

B. subtilis YazC protein 
cysteinyl-tRNA synthetase 

Unknwon, similar to 
transcriptional regulatory 

proteins, AraC family 
unknown, similar to serine 

O-acetyltransferase 
unknown, highly similar to 
glutamyl-tRNA synthetase 
Unknown, similar to B. 
subtilis YacN protein 
Unknown, similar to 
transcription regulator 
Unknown, weakly similar to 
Nad(P)h Oxidoreductase 
chain B 
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SeqlDn°1863 LM-3407.1 From 1863501 to 1864217 

SeqID n° 1864 LM-3408.1 From 1864248 to 1864637 
SeqlDn°1865 LM-3409.1 From 1864657 to 1865394 



SeqID n° 1866 LM-3410.1 From 1865394 to 1865912 



SeqID n° 1867 

SeqID n° 1868 

SeqID n° 1869 
SeqID n 6 1870 

SeqID n° 1871 

SeqID n° 1872 

SeqID n° 1873 

SeqID n° 1874 
SeqID n° 1875 

SeqID n° 1876 

SeqID n° 1877 

SeqID n° 1878 

SeqID n° 1879 

SeqID n° 1880 



LM-3411.1 

LM-3412.1 

LM-3414.2 
LM-3415.4 

LM-3416.3 

LM-3417.3 

LM-342.1 

LM-3420.4 
LM-3421.2 

LM-3422.1 

LM-3424.1 

LM.3426.1 

LM-3427.1 

LM-3428.1 



From 1865922 

From 1866336 

From 1867427 
From 1723649 

From 1723356 

From 1722431 

From 115087 

From 1716858 
From 1748792 

From 1749878 

From 1751283 

From 1752413 

From 1753095 

From 1753845 



to 1866308 

to 1867070 

to 1867699 
to 1724122 

to 1723598 

to 1723339 

to 116829 

to 1717160 
to 1749733 

to 1751176 

to 1752371 

to 1752940 

to 1753841 

to 1754942 



SeqID n° 1881 LM-3429.1 From 1755143 to 1756123 

SeqID n° 1882 LM-343.1 From 113313 to 115094 

SeqID n° 1883 LM-3430.2 From 1756156 to 1756617 

SeqID n° 1884 LM-3431.2 From 2891269 to 2891652 

SeqID n° 1885 LM-3432.2 From 2890779 to 2891171 

SeqID n° 1886 LM-3433.1 From 2890274 to 2890666 

SeqID n° 1887 LM-3434.1 From 2888994 to 2890277 

SeqID n° 1888 LM-3436.1 From 2888639 to 2889001 



Unknown, similar to 
unknown proteins 
Unknown 
Unknown, similar to E. coli 
tRNA (guanine-N1) 
methyltransferase 
Unknown, similar to 
putative 16S rRNA 
processing protein RimM 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
unknown proteins 
ribosomal protein S16 
unknown, some similarity 
to hypothetical proteins 
unknown, some similarity 
to hypothetical proteins 
unknown, similar to L- 
lactate dehydrogenases 
Unknown, ABC transporter, 
ATP-binding protein 

unknown 
unknown; similar to 
glycerate dehydrogenases 
glutamate-1-sernialdehyde 
aminotransferase 
unknown, similar to 
hypothetical proteins 
unknown, similar to 
hypothetical proteins 
unknown, similar to 
glucose 1-dehydrogenase 
unknown, similar to A/G- 
specific adenine 
glycosylate 
unknown, similar to 
hypothetical proteins 
Unknwon, similar to ABC 
transporter, ATP-binding 
protein 
unknown, similar to 
deoxyuridine triphosphate 
nucleotidohydrolases 
Unknown, hypothetical 

secreted protein 
Unknown, hypothetical 

secreted protein 
Unknown, hypothetical 
secreted protein 
Unknown 
Unknown 
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SeqlDn°1889 LM-3439.2 From 2887499 to 2888215 
SeqID n° 1890 LM-3442.2 From 885440 to 886882 



SeqID n° 1891 LM-3444.1 From 886875 to 887603 

SeqlDn°1892 LM-3445.2 From 887652 to 889502 

SeqlDn°1893 LM-3447.3 From 2068977 to 2069648 

SeqlDn°1894 LM-3448.1 From 2069710 to 2070657 

SeqlDri°1895 LM-3449.1 From 2070754 to 2071170 

SeqlDn°1896 LM-345.1 From 112386 to 113288 

SeqID n° 1897 LM-3450.1 From 2071187 to 2072182 

SeqlDn°1898 LM-3451.1 From 2072205 to 2073269 

SeqlDn°1899 LM-3452.1 From 2073282 to 2074103 

SeqlDn°1900 LM-3453.1 From 2074084 to 2074902 

SeqlDn°1901 LM-3455.1 From 2074925 to 2075392 

SeqlDn°1902 LM-3456.2 From 2075412 to 20761 13 

SeqID n° 1 903 LM-3458.2 From 20761 1 5 to 2076846 

SeqlDn°1904 LM-3459.2 From 607372 to 608199 

SeqID n° 1905 LM-3460.1 From 608184 to 608480 

SeqID n° 1906 LM-34611 From 608557 to 609588 

SeqID n° 1907 LM-3462.2 From 609629 to 610924 

SeqID n° 1908 LM-3463.2 From 1776257 to 1776829 

SeqID n° 1909 LM-3465.1 From 1774991 to 1775983 



GidB protein 
unknown, similar to 
Glutamine binding and 
transport protein 
unknown, similar to amino 
acid ABC transporter, ATP- 
binding protein 
Unknown, similar to 

amidases 
Unknown, similar to 
deoxyribose-phosphate 
aldolase 
Unknown, similar to 
transcription repressor of 
dra/nupC/pdp operon 
DeoR 

Unknown, similar to PTS 
mannose-specific enzyme 
IIA component 
Unknown, similar to 
transcription regulator 
Unknown, similar to opine 

catabolism protein 
Unknown, weakly similar to 
glucosamine-fructose-6- 
phosphate 
aminotransferase 
Unknown, similar to 
mannose-specific enzyme 

II D component 
Unknown, similar to PTS 
mannose-specific enzyme 

IIC component 
Unknown, similar to PTS 
mannose-specific enzyme 
IIB component 
Unknown, similar to 
transcription regulator 

GntR family 
Unknown, similar to 
transcription regulator 
GntR family 
Unknown, similar histidinol 
phosphate phosphatase 
Unknown, similar to 
methyltransferase 
Unknown 
Unknown, conserved 
hypothetical protein 
unknown 
unknown, similar to cell- 
shape determining proteins 
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SeqlDn°1910 LM-3466.1 From 1773514 to 1774773 



SeqlDn°1911 LM-3468.3 From 1772178 to 1773410 



SeqID n° 1912 
SeqlDn°1913 

SeqID n°1914 

SeqID n° 1915 

SeqID n°1916 
SeqID n° 1917 



LM-3469.2 
LM-3473.1 

LM-3475.1 

LM-3479.2 

LM-3480.3 
LM-3481.2 



From 833117 
From 831025 

From 830259 

From 827666 

From 826752 
From 2011128 



to 833587 
to 833073 

to 830897 

to 828016 

to 827513 
to 2012438 



SeqID n° 1918 LM-3482.1 From 2010089 to 2011105 



SeqID n° 1919 LM-3483.2 From 2009006 to 2009986 

SeqID n° 1920 LM-3484.1 From 2008319 to 2008594 

SeqID n° 1921 LM-3485.1 From 2007515 to 2008084 

SeqID n° 1922 LM-3486.1 From 2006689 to 2007456 



SeqID n° 1923 LM-3487.1 From 2005953 to 2006666 



SeqID n° 1924 LM-3488.1 From 2004977 to 2005942 



SeqID n° 1925 LM-3489.2 From 2004515 to 2004958 

SeqID n° 1926 LM-349.1 From 110013 to 112283 

SeqID n° 1927 LM-3490.2 From 1029208 to 1029819 

SeqID n° 1928 LM-3491.1 From 1029965 to 1030756 

SeqID n° 1929 LM-3493.3 From 1030757 to 1032229 



unknown, similar to 
multidrug resistance 
protein, integral membrane 

protein 
unknown, highly similar to 
aminopeptidases 
Unknown 
Unknown, similar to 
putative Na+/H+ antiporter 
Unknown, weakly similar to 
GTP-pyrophosphokinase 
Unknown, similar to B. 
subtilis YqkB protein 

Unknown 
Unknown, similar to 

unknown protein 
Unknown, similar to 
NAD(P)H-dependent 
glycerol-3-phosphate 

dehydrogenase 
Unknown, similar to 
protein-tyrosine/serine 

phosphatase 
Unknown, similar to non- 
specific DNA-binding 
protein HU 
Unknown, similar to GTP 
cyclohydrolase I 
Unknown, similar to 
heptaprenyl diphosphate 
synthase component I 
Unknown, similar to 2- 
heptaprenyl-1,4- 
naphthoquinone 
methyltransferase 
Unknown, similar to 
heptaprenyl diphosphate 
synthase component II 
(menaquinone 
biosynthesis) 
Unknown, similar to 
nucleoside diphosphate 
kinase 

Unknown, highly similar to 
chitinase B 
unknown, similar to 
hypothetical protein 

unknown 
unknown, similar to 
PHYTOENE 
DEHYDROGENASE (EC 
1.3.-.-) (PHYTOENE 
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Seq1Dn°1930 LM-3494.3 From 1032333 to 1032617 

Seq!Dn°1931 LM-3495.3 From 1032924 to 1033190 

SeqIDn°1932 LM-3497.4 From 1033190 to 1034908 

SeqlDn°1933 LM-3498.1 From 1035021 to 1036055 

SeqID n° 1934 LM-35.1 From 2735279 to 2735953 

SeqIDn°1935 LM-3500.1 From 1036068 to 1036928 

SeqID n° 1936 LM-3501.2 From 1036971 to 1038116 

SeqID n° 1937 LM-3504.2 From 641208 to 642308 

SeqID n° 1938 LM-3505.1 From 642414 to 642914 

SeqID n° 1939 LM-3506.1 From 642962 to 643348 

SeqID n° 1940 LM-3507.1 From 643400 to 643744 

SeqID n° 1941 LM-3508.2 From 643891 to 645231 



SeqID n° 1942 LM-351.2 From 109349 to 109720 

SeqID n° 1943 LM-3510.2 From 2145652 to 2146044 

SeqID n° 1944 LM-3511.1 From 2146089 to 2146298 

SeqID n° 1945 LM-3512.1 From 2146449 to 2147426 



SeqID n° 1946 LM-3514.2 From 2147684 to 2149312 

SeqID n° 1947 LM-3517.2 From 1888615 to 1890273 

SeqID n° 1948 LM-3518.1 From 1887913 to 1888575 

SeqID n° 1949 LM-3519.1 From 1886887 to 1887777 

SeqID n° 1950 LM-352.2 From 108624 to 109256 

SeqID n° 1951 LM-3520.2 From 1884846 to 1886894 



unknown, similar to B. 
subtilis protein YkvS 
PHOSPHOCARRIER 
PROTEIN HPR 
(HISTIDINE-CONTAINING 
PROTEIN), 
phosphotransferase 

system enzyme I 
unknown, conserved 
hypothetical protein 
Unknown, similar to 
ribulose-5-phosphate 3- 

epimerase 
unknown, similar to 3- 
hydroxyisobutyrate 
dehydrogenase (B. subtilis 
YkwC protein) 
unknown, similar to 
aminotransferases (to B. 

subtilis PatA protein) 
UNknown, similar to cell 
surface protein 
Unknown, weakly similar to 
transcription regulator 

Unknown 
Unknown, similar to B. 
subtilis YvIA protein 
Unknown, conserved 
hypothetical membrane 
protein 
Unknwon 
Unknown 
Unknown 
Unknown, similar to 
conjugated bile acid 
hydrolase 
class I heat-shock protein 
(chaperonin) GroEL 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
phosphoglycerate 
dehydrogenase 
Unknown, similar to L- 
serine dehydratase 
Unknwon, similar to NADH 

oxidase 
Unknown, similar to ATP- - 
dependent DNA helicase 
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SeqID n° 1952 

SeqID n° 1953 

SeqID n°1954 

SeqID n°1955 

SeqID n° 1956 

SeqID n° 1957 
SeqID n°1958 
SeqID n° 1959 



LM-3523.1 

LM-3524.1 

LM-3525.1 

LM-3527-1 

LM-3528.2 

LM-353.1 
LM-3533.1 
LM-3534.1 



From 2143109 

From 2142480 

From 2141820 

From 2141071 

From 2140046 

From 108320 
From 1053426 
From 1054319 



to 2144734 

to 2143097 

to 2142461 

to 2141814 

to 2140963 

to 108610 
to 1054277 
to 1055284 



SeqID n° 


1960 


LM-3535.1 


From 1055393 


to 1057060 


SeqID n° 


1961 


LM-3537.1 


From 1057761 


to 1058531 


SeqID n° 


1962 


LM-3538.3 


From 1058580 


to 1059608 


SeqID n° 
SeqID n° 


1963 
1964 


LM-3539.1 
LM-354.1 


From 2286839 
From 107915 


to 2287183 
to 108208 


SeqID n° 


1965 


LM-3541.1 


From 2287518 


to 2288513 


SeqID n° 


1966 


LM-3543.1 


From 2288571 


to 2288987 


SeqID n° 


1967 


LM-3544.1 


From 2288971 


to 2289423 


SeqID n° 


1968 


LM-3547.1 


From 2289545 


to 2290786 


SeqID n° 


1969 


LM-3549.2 


From 2290989 


to 2291927 


SeqID n° 
SeqID n° 


1970 
1971 


LM-355.1 
LM-3551.1 


From 107499 
From 1242100 


to 107849 
to 1244508 


SeqID n° 


1972 


LM-3553.2 


From 1241048 


to 1242100 


SeqID n° 


1973 


LM-3555.2 


From 2305632 


to 2305994 


SeqID n° 


1974 


LM-3556.1 


From 2305180 


to 2305602 



recG 

Unknown, similar to copper 
export proteins 
Unknown, similar to 

unknown protein 
Unknown, similar to 

unknown protein 
Unknown, similar to 
potassium channel subunit 
Unknown, similar to heme 
O oxygenase 
Unknwon 
unknown 
unknown, similar to B. 
subtilis LytR protein 
unknown, similar to 
conserved hypothetical 
proteins (in particular B. 
subtilis YkqC) 
unknown, similar to 
conserved hypothetical 
proteins 
unknown, similar to 
transcriptional regulator, 
Lad family 
Unknown 
Unknwon, similar to 
transcription regulator 
tryptophanyl-tRNA 

synthetase 
Unknown, similar to 

unknown protein 
Unknown, similar to 
transcription regulator 
Unknown, similar to 3- 
oxoacyl-acyl-carrier protein 
synthase 
Unknown, similar to 3- 
oxoacyl- acyl-carrier 
protein synthase 

Unknwon 
phenylalanyl-tRNA 
synthetase beta subunit 

phenylalany-tRNA 
synthetase beta subunit 
Unknown, similar to 

unknown protein 
Unknown, similar to 
histidine triad (HIT) protein 
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SeqlDn°1975 LM-3557.1 From 2304267 to 2305019 



SeqID n° 1976 LM-3558.1 From 2303060 to 2304283 



Seq!Dn°1977 LM-3559.1 From 2302518 to 2303021 

SeqID n° 1978 LM-356.1 From 106988 to 107377 

SeqID n° 1979 LM-3560.1 From 2301260 to 2302321 

SeqID n° 1980 LM-3561.1 From 2300334 to 2301263 

SeqID n° 1981 LM-3562.2 From 2299833 to 2300177 

SeqID n° 1982 LM-3563.2 From 1548034 to 1548456 

SeqID n° 1983 LM-3565.2 From 1548701 to 1549906 

SeqID n° 1984 LM-3566.1 From 1549920 to 1550285 

SeqID n° 1985 LM-3567.1 From 1550435 to 1550815 

SeqID n° 1986 LM-3569.1 From 1550854 to 1552629 

SeqID n° 1987 LM-357.1 From 105951 to 106862 



SeqID n° 1988 LM-3570.2 From 1552632 to 1553909 
SeqID n° 1989 LM-3571.4 From 1713234 to 1715099 

SeqID n° 1990 LM-3572.1 From 1712634 to 1713209 



SeqID n° 1991 LM-3573.2 From 171 1672 to 1712637 

SeqID n° 1992 LM-3574.3 From 896003 to 897028 

SeqID n° 1993 LM-3576.1 From 895278 to 895991 

SeqID n° 1994 LM-3577.1 From 893838 to 895211 



SeqID n° 1995 LM-3578.1 From 892662 to 893774 
SeqID n° 1996 LM-3579.1 From 892192 to 892512 



Unknown, similar to ABC 
transporter (ATP-binding 

protein) 
Unknown, similar to ABC 
transporter (membrane 
protein) 
Unknown, similar to 
unknown protein 
unknown 
Unknown, similar to 
uroporphyrinogen III 

decarboxylase 
Unknown, similar to 
ferrochelatase 
Unknown 
Unknown, similar to 

unknown protein 
Unknown, similar to 
ammonium transporter 
NrgA 
Unknown, similar to 
nitrogen regulatory Pll 
protein 
Unknown 
aspartyl-tRNA synthetase 
Unknown, similar to PTS 
system mannose-specific, 

factor I ID 
histidyl-tRNA synthetase 

unknown, similar to 
asparagine synthetase 

unknown, similar to 
conserved hypothetical 
protein 
unknown, similar to 
conserved hypothetical 
proteins 
unknown, similar to 
transcription regulator lad 
family 
unknown, similar to 
carboxylesterase 
UDP-N- 
acety Imura moy lala nyl-D- 

glutamyl-2,6- 
diaminopimelate-D-alanyl- 

D-alanyl ligase 
D-alanine-D-alanine ligase 
Unknown, similar to E. coli 
SugE protein 
(transmembrane 
chaperone) 
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SeqID n° 1 997 LM-358. 1 From 1 05123 

SeqID n° 1998 LM-3581.1 . From 891848 

SeqID n° 1999 LM-3582.1 From 891277 

SeqID n° 2000 LM-3583.3 From 890439 

SeqID n° 2001 LM-3585.2 From 1926935 

SeqID n° 2002 LM-3588.1 From 1928798 

SeqID n° 2003 LM-3589.1 From 1931026 

SeqID n° 2004 LM-359.1 From 104134 

SeqID n° 2005 LM-3590.2 From 1931442 

SeqID n° 2006 LM-3591.3 From 1932324 

SeqID n° 2007 LM-3592.2 From 2487261 

SeqID n° 2008 LM-3593.1 From 2488379 

SeqID n° 2009 LM-3596.1 From 2489274 

SeqID n° 2010 LM-3597.1 From 2489945 

SeqID n° 201 1 LM-3599.1 From 2491552 

SeqID n° 2012 LM-36.1 From 2735962 

SeqID n° 2013 LM-360.1 From 103204 

SeqID n° 2014 LM-3600.1 From 2492684 

SeqID n° 2015 LM-3601.3 From 2493454 

SeqID n° 2016 LM-3602.2 From 1435635 

SeqID n° 2017 LM-3603.1 From 1437943 



to 105929 Unknown, similar to PTS 
system mannose-specific, 
factor IIC 

to 892189 Unknown, similar to E. coli 
SugE protein 
(transmembrane 
chaperone) 
to 891 831 Unknown, similar to 
transcription regulator 
TetR/AcrR family 
to 891197 Unknown 
to 1 928425 unknown, similar to 
carboxy-terminal 
processing proteinase 
to 1931011 unknown, similar to heavy 
metal-transporting 
ATPases 

to 1931319 unknown, similar to 
conserved hypothetical 
proteins 

to 105099 Unknown, similar to PTS 
system mannose-specific, 
factor IIAB 
to 1932266 unknown, similar to similar 
to D-alanyl-D-alanine 
carboxypeptidases 
to 1933025 purine nucleoside 

phosphorylase 
to 2488337 Unknown 
to 2489209 Unknown, conserved * 

lipoprotein 
to 2489948 Unknown, similar to ABC 
transporter, permease 
protein 

to 2490967 Unknown, similar to ABC 
transporter (ATP-binding 
protein) 

to 2492694 Unknown, similar to two- 
component sensor histidine 
kinase 

to 2736402 Unknwon, similar to ribose 

5-phosphate epimerase 
to 103836 unknown 
to 2493379 Unknown, similar to two- 
component response 
regulator 

to 2494329 Unknown, conserved 
hypothetical protein 
to 1437866 pyruvate formate-lyase 
to 1438689 pyruvate-formate lyase 
activating enzyme 
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SeqlDn°2018 LM-3604.1 From 1438724 to 1439254 
SeqID n° 2019 LM-3605.2 From 1439421 to 1440614 
SeqlDn°2020 LM-3608.2 From 1104863 to 1105864 



SeqID n° 2021 LM-361.1 From 102468 to 103028 
SeqIDn°2022 LM-3612.1 From 1107975 to 1109663 



SeqID n° 2023 LM-3613.1 From 1109705 to 1110577 

SeqID n° 2024 LM-3614.3 From 1 110769 to 1 113627 

SeqID n° 2025 LM-3615.2 From 1277312 to 1278025 

SeqID n° 2026 LM-3616.1 From 1278056 to 1279702 

SeqID n° 2027 LM-3617.3 From 1279721 to 1281205 

SeqID n° 2028 LM-3618.3 From 1434661 to 1435209 

SeqID n° 2029 LM-362.1 From 102155 to 102478 

SeqID n° 2030 LM-3620.2 From 1432839 to 1434644 

SeqID n° 2031 LM-3623.1 From 1430237 to 1432819 

SeqID n° 2032 LM-3624.2 From 1429765 to 1430127 

SeqID n° 2033 LM-3625.2 From 758656 to 759396 

SeqID n° 2034 LM-3626.2 From 756738 to 758543 

SeqID n° 2035 LM-3627.2 From 598020 to 598925 

SeqID n° 2036 LM-3628.2 From 598968 to 600344 



Unknown, similar to 
unknown proteins 
Unknown, similar to 
multidrug-efflux transporter 
unknown, similar to 
TEICHOIC ACID 
TRANSLOCATION ATP- 
BINDING PROTEIN TAGH 
(ABC transporter) 

unknown 
unknown, similar to 
TEICHOIC ACID 
BIOSYNTHESIS PROTEIN 
B PRECURSOR 
unknown, similar to 
putative UDP-glucose 
pyrophosphorylases 
unknown, similar to B. 
subtilis YfhO protein 
Unknown, similar to 
transcription regulator 

GntR family 
Unknown, similar to 

alpha, alpha- 
phosphotrehalase 
Unknown, similar to PTS 
system trehalose specific 
enzyme IIBC 
Unknown, similar to 
putative anti-terminator 

regulatory protein 
Unknwon, similar to ATP 
synthase epsilon chain 
DNA mismatch repair 

protein 
DNA mismatch repair 

(recognition) 
Unknown, similar to B. 
subtilis YmcA protein 
Unknown, similar to 
riboflavin kinase / FAD 

synthase 
Unknown, similar to L- 
gIutamine-D-fructose-6- 
phosphate 
amidotransferase 
• Unknown, putative 
membrane protein 
Unknown, similar to NADP- 
specific glutamate 
dehydrogenase 



WO 01/77335 



141 



PCT/FR01/01118 



SeqID n° 2037 LM-363.1 From 100772 to 102142 

SeqID n° 2038 LM-3630.4 From 134782 to 136290 

SeqID n° 2039 LM-3631.2 From 133961 to 134710 

SeqID n° 2040 LM-3632.2 From 2292047 to 22931 74 

SeqID n° 2041 LM-3633.1 From 2293775 to 2294464 

SeqID n° 2042 LM-3635.2 From 2294555 to 2297155 



SeqID n° 2043 LM-3636.2 From 1281321 to 1281773 

SeqID n° 2044 LM-3637.1 From 1281820 to 1282284 

SeqID n° 2045 LM-3638.1 From 1282473 to 1283393 

SeqID n° 2046 LM-3639.1 From 1283413 to 1284660 

SeqID n° 2047 LM-364.1 From 99902 to 100771 

SeqID n° 2048 LM-3640.1 From 1284644 to 1285474 

SeqID n° 2049 LM-3643.3 From 1285610 to 1286749 

SeqID n° 2050 LM-3644.4 From 1286830 to 1287225 



SeqID n° 2051 LM-3645.2 From 1765915 to 1767294 
SeqID n° 2052 LM-3647.1 From 1765499 to 1765900 



SeqID n° 2053 LM-3648.1 From 17651 17 to 1765470 
SeqID n° 2054 LM-3649.1 From 1763941 to 1764843 



SeqID n° 2055 LM-3650.1 From 1763209 to 1763751 



SeqID n° 2056 LM-3652.2 From 1762200 to 1763165 



SeqID n° 2057 LM-3656.2 From 2840286 to 2841896 



SeqID n° 2058 LM-3657.2 From 2839698 to 2840228 

SeqID n° 2059 LM-3658.1 From 1595227 to 1597149 
SeqID n° 2060 LM-3659.1 From 1597499 to 1598422 



Unknown, similar to ATP 

synthase beta chain 
Unkown, similar to inosine 
monophosphate 
dehydrogenase 
Unknwon, conserved 
hypothetical protein 
Unknown, similar to N- 
acetylmuramoyl-L-alanine 
amidase and to intemalin B 

Unknown, similar to 
phosphoglyceromutase 1 

Unknown, similar to 
endopeptidase Clp ATP- 
binding chain B (CIpB) 
unknown 
unknown 
unknown 
gamma-glutamyl 
phosphate reductase 
unknwon, similar to ATP 
synthase gamma chain 
gamma-glutamyl kinase 
unknown 
unknown, similar to 
transcriptional regulator 
(phage-related) 
unknown, similar to similar 
to RNA methyltransferases 
unknown, similar to 
glutathione transferase - 
fosfomycin resistance 
protein 
unknown 
unknown, some similarities 
to methyl-accepting 
chemotaxis proteins 
unknown, similar to 
ribosomal-protein-alanine 
N-acetyltransferase 
unknown, similar to 
putative transmembrane 

proteins 
Unknown, similar to ABC 
transporter (ATP-binding 
protein) 
Unknown, similar to 
unknown protein 
threonyl-tRNA synthetase 
primosome component 
(helicase loader) Dnal 
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SeqlDn°2061 LM-366.1 From 98409 to 99905 
SeqlDn°2062 LM-3660.2 From 1598432 to 1599808 



SeqID n° 2063 

SeqID n° 2064 

SeqID n° 2065 

SeqID n° 2066 
SeqID n° 2067 
SeqID n' 2068 
SeqID n' 2069 
SeqID n° 2070 

SeqID n° 2071 
SeqID n° 2072 



LM-3662.3 

LM-3663.2 

LM-3664.1 

LM-3666.1 
LM-3667.1 
LM-3668.1 
LM-3669.1 
LM-367.1 

LM-3671.2 
LM-3673.2 



From 2708276 

From 2707235 

From 2706415 

From 2705746 
From 2705082 
From 2704433 
From 2704149 
From 97381 

From 2703275 
From 2033587 



to 2709358 

to 2708194 

to 2707215 

to 2706054 
to 2705711 
to 2705056 
to 2704433 
to 98412 

to 2704108 
to 2034528 



SeqID n° 2073 LM-3674.1 From 2032460 to 2033485 

SeqID n" 2074 LM-3676.2 From 2031438 to 2032460 

SeqID n° 2075 LM-3677.2 From 1756864 to 1757673 

SeqID n° 2076 LM-3678.1 From 1757771 to 1758673 

SeqID n° 2077 LM-3679.2 From 1758694 to 1761291 



SeqID n° 2078 

SeqID n° 2079 

SeqID n° 2080 
SeqID n° 2081 
SeqID n° 2082 
SeqID n° 2083 
SeqID n° 2084 

SeqID n° 2085 

SeqID n° 2086 

SeqID n° 2087 

SeqID n° 2088 



LM-368.1 

LM-3680.2 

LM-3681.2 
LM-3684.3 
LM-3692.1 
LM-3693.1 
LM-3694.1 

LM-3695.2 

LM-3696.1 

LM-3697.2 

LM-3698.2 



From 97127 

From 1761318 

From 1708312 
From 1708857 
From 2161161 
From 2160538 
From 2160043 

From 980177 

From 981743 

From 982957 

From 983768 



to 97369 

to 1761992 

to 1708518 
to 1711268 
to 2162054 
to 2161164 
to 2160432 

to 981307 

to 982960 

to 983646 

to 984814 



Unknwon, similar to ATP 

synthase alpha chain 
chromosome replication 

initiation / membrane 
attachment protein DnaB 
Unknown, conserved 
hypothetical lipoprotein 
Unknown, weakly similar to 
E. coli MenA protein 
Unknown, similar to B. 
subtilis YbaF protein 
ribosomal protein S10 
ribosomal protein L3 
ribosomal protein L4 
ribosomal protein L23 
Unknwon, weakly similar to 
ATP synthase delta chain 
ribosomal protein L2 
Unknown, similar to 
ferrichrome binding protein 
Unknown, similar to 
ferrichrome ABC 
transporter (permease) 
Unknown, similar to 
ferrichrome ABC 
transporter (permease) 
unknown, similar to 
hypothetical proteins 
unknown, similar to CDP- 
abequose synthase 
unknown, similar to 
putative membrane 
proteins 
Unknwon, similar to ATP 
synthase C chain 
unknown, similar to 
unknown proteins 
unknown 
leucyl-tRNA synthetase 
Unknown 
unknown 
Unknown, similar to 
unknown proetin 
Unknown, similar to B. 
subtilis ComEC protein 
unknown, hypothetical 
transport protein 
unknown, similar to 
transcription regulator 
unknown, conserved 
hypothetical membrane 
protein 
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From 2658024 


to 2658851 


From 94764 
From 1936764 


to 97070 
to 1937603 


From 1935964 


to 1936749 


From 1935334 


to 1935948 


From 1934765 


to 1935298 


From 1934321 


to 1934758 


From 1933333 


to 1934238 


From 822611 


to 823531 


From 821886 


to 822527 


From 821070 


to 821771 


From 820145 


to 821035 


From 1767359 


to 1767739 


From 1767846 


to 1768490 


From 1768520 


to 1769389 


From 1769720 


to 1770526 


From 1770648 


to 1771406 


From 1818007 
From 1817112 


to 1818516 
to 1817879 


From 1815143 


to 1817122 


From 1293088 


to 1293654 



Unknown, conserved 
hypothetical protein 

unknwon 
unknown, similar to 
hypothetical proteins 
unknown, similar to 
hypothetical proteins 
unknown, similar to 
hypothetical proteins 
unknown, similar to 
peptidyl methionine 
sulfoxide reductases 
unknown, similar to 
transcriptional regulator 
(PilB family) 
unknown, similar to 
dehydogenases and 
hypothetical proteins 
Unknown, conserved 
hypothetical protein 
Unknown, similar to B, 
subtilis YwnB protein 
Unknown, similar to 
conserved hypothetical 
protein 
Unknown, similar to 
conserved hypothetical 
protein 
unknown, similar to 
conserved hypothetical 
proteins 
unknown, similar to 
deoxyguanosine 
kinase/deoxyadenosine 
kinase(l) subunit 
unknown, similar to 
transport proteins 
unknown, similar to 
aminoglycoside N3 - 
acetyltransferases 
unknown, similar to 

methionine 
aminopeptidases 
Unknown 
Unknown, similar to ABC 
transporter (ATP-binding 

protein) 
Unknown, similar to ABC 
transporter (permease) 
unknown, similar to type-l 
signal peptidase 
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SeqlDn°2110 LM-3722.1 From 1291710 to 1292969 



SeqlDn°2111 LM-3724.1 From 1290241 to 1291524 



SeqID n°2112 
SeqID n°2113 



LM-3725.2 
LM-3726.3 



From 1289188 
From 1703330 



to 1290126 
to 1703824 



SeqID n° 21 14 LM-3727.1 From 1704618 to 1705181 



SeqID n°2115 
SeqID n°2116 
SeqID n°2117 

SeqID n°2118 
SeqID n°2119 



LM-3728.1 
LM-373.1 
LM-3732.2 

LM-3733.3 
LM-3735.3 



From 1705199 
From 88888 
From 1706204 

From 1707168 
From 1645412 



to 1705630 
to 94767 
to 1707088 

to 1707917 
to 1645936 



SeqID n° 2120 LM-3736.3 From 1644106 to 1645191 



SeqID n°2121 
SeqID n° 2122 
SeqID n° 2123 
SeqID n° 2124 

SeqID n° 2125 

SeqID n° 2126 

SeqID n°2127 

SeqID n°2128 

SeqID n°2129 
SeqID n°2130 

SeqID n°2131 

SeqID n° 2132 

SeqID n°2133 
SeqID n°2134 



LM-3737.1 
LM-3739.2 
LM-374.1 
LM-3746.2 

LM-3747.1 

LM-3749.2 

LM-375.1 

LM-3750.2 

LM-3754.2 
LM-3756.2 

LM-3757.2 

LM-3758.2 

LM-3759.2 
LM-376.1 



From 1642865 
From 1641284 
From 88121 
From 1428442 

From 1426766 

From 1425419 

From 86747 

From 356890 

From 360172 
From 1407859 

From 1406742 

From 824936 

From 824423 
From 86323 



to 1643872 
to 1642543 
to 88810 
to 1428939 

to 1428328 

to 1426465 

to 87730 

to 357705 

to 360507 
to 1408818 

to 1407818 

to 826396 

to 824884 
to 86691 



SeqID n° 2135 LM-3763.1 From 1659457 to 1660926 

SeqID n° 2136 LM-3764.1 From 1660923 to 1661363 

SeqID n° 2137 LM-3766.3 From 1661588 to 1662457 

SeqID n° 2138 LM-3767.2 From 2212384 to 2212620 



ATP-dependent Clp 
protease ATP-binding 

subunit CIpX 
trigger factor (prolyl 
isomerase) 
unknown 
unknown, putative 
cellsurface protein 
unknown, similar to 
unknown proteins 
unknown 
Unknown 
similar to translation 
elongation factor 
30S ribosomal protein S2 
Unknown, similar to 
general stress protein 
3-deoxy-D-arabino- 
heptulosonate 7-phosphate 

synthase 
catabolite control protein A 
tyrosyl-tRNA synthetase 

unknwon 
Unknown, similar to N- 
acetyltransferase 
Unknown, similar to 
unknown protein 
Recombination protein 
recA 

Unknwon, similar to 

oxidoreductases 
Unknown, similar to 
transposase 
Unknown 
Unknown, similar to 

unknown protein 
Unknown, similar to 
unknown protein 
Unknown, similar to lysine- 
specific permease 

Unknown 
Unknown, similar to 
transcription regulator 

(merR family) 
Unknown, similar to 
multidrug-efflux transporter 
Unknown, similar to 
transcription regulator 
MarR family 
D-Amino Acid 
Aminotransferase 
unknown 
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SeqlDn°2139 LM-3768.2 From 2213218 to 2215041 



SeqlDn°2140 LM-3769.2 From 2306177 
SeqID n° 2141 LM-377.1 From 85902 
SeqlDn°2142 LM-3770.2 From 2306833 



SeqID n° 2143 LM-3772.2 From 2307755 

SeqID n° 2144 LM-3774.3 From 991810 

SeqID n° 2145 LM-3775.1 From 990968 

SeqID n° 2146 LM-3776.2 From 990248 



SeqID n° 2147 LM-3778.2 From 1861907 

SeqID n° 2148 LM-3779.3 From 1860200 

SeqID n° 2149 LM-378.1 From 85025 

SeqID n° 2150 LM-3783.2 From 989099 



SeqID n° 2151 LM-3785.2 From 988065 

SeqID n° 2152 LM-379.1 From 84451 

SeqID n° 2153 LM-3790.3 From 1420076 

SeqID n° 2154 LM-3791.2 From 1778415 

SeqID n° 2155 LM-3792.1 From 1777763 

SeqID n° 2156 LM-3793.2 From 1776910 

SeqID n° 2157 LM-3795.3 From 1421454 

SeqID n° 2158 LM-3797.2 From 1422236 

SeqID n° 2159 LM-3798.1 From 1423255 



to 2306716 
to 86228 
to 2307714 



to 2308696 
to 992865 

to 991690 
to 990952 



to 1862251 
to 1861090 
to 85627 
to 990232 



to 988910 
to 84849 
to 1421362 

to 1779482 

to 1778338 

to 1777578 
to 1422185 

to 1423165 
to 1423833 



SeqID n° 2160 LM-380.1 From 82959 to 84437 



Unknown, similar to 
unknown protein 
Unknown 
Unknwon 
Unknown, similar to post- 
translocation molecular 

chaperone 
Unknown, similar to S. 
aureus Cbf1 protein 
unknown, similar to 
UNDECAPRENYL- 
PHOSPHATE N- 
ACETYLGLUCOSAMINYL 
TRANSFERASE 
unknown, similar to 
transcription regulator 
(GntR family) 
GLUCOSAMINE-6- 
PHOSPHATE 
ISOMERASE (EC 5.3.1.10) 
(GLUCOSAMINE-6- 
PHOSPHATE 
DEAMINASE) (GNPDA) 
(GLCN6P DEAMINASE), 
ribosomal protein L19 
intemalin C 
Unknwon 
N- 

ACETYLGLUCOSAMINE- 
6-PHOSPHATE 
DEACETYLASE (EC 
3.5. 1.25) (GLCNAC6-P 
DEACETYLASE). 
unknown 
Unknwon 
Unknown, similar to 
putative protease 
unknown, similar to 
hypothetical proteins 
unknown, similar to 
putative transcription 

regulators 
unknown, similar to 
hypothetical proteins 
Unknown, similar to 3- 
ketoacyl-acyl carrier protein 
reductase 
Unknown, similar to 

unknown protein 
Unknown, similar to 
phosphatidylglycerophosph 
ate synthase 
Unknwon 
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SeqID h° 2161 LM-3800.3 From 1423902 to 1425146 



SeqID n° 2162 LM-3801.3 From 984823 to 985722 
SeqID n° 2163 LM-3802.4 From 985743 to 986564 
SeqID n° 2164 LM-3806.3 From 1578625 to 1579425 



SeqID n 6 2165 LM-3807.3 From 1577212 to 1578573 



SeqID n° 2166 
SeqID n° 2167 



LM-3808.2 
LM-3809.2 



From 2298927 
From 2298093 



to 2299712 
to 2298863 



SeqID n° 2168 LM-3810.3 From 333534 to 335195 



SeqID n° 2169 
SeqID n° 2170 



LM-3811.3 
LM-3812.3 



From 335350 
From 1662570 



to 336426 
to 1663982 



SeqID n° 2171 LM-3818.1 From 2211280 to 2212245 



SeqID n° 2172 
SeqID n° 2173 


LM-3819.1 
LM-382.1 


From 2210609 
From 81661 


to 221 1244 
to 82617 


SeqID n° 2174 


LM-3821.1 


From 2159697 


to 2160053 


SeqID n° 2175 


LM-3822.1 


From 2149348 


to 2149632 


SeqID n° 2176 


LM-3823.3 


From 2943120 


to 2943479 


SeqID n° 2177 
SeqID n° 2178 


LM-3825.4 
LM-3826.1 


From 2943871 
From 1240358 


to 2944341 
to 1240681 


SeqID n° 2179 


LM-383.1 


From 80892 


to 81530 


SeqID n° 2180 


LM-3845.2 


From 979762 


to 980064 


SeqID n° 2181 


LM-3846.1 


From 979059 


to 979529 


SeqID n° 2182 


LM-3847.1 


From 940316 


to 940696 


SeqID n° 2183 


LM-3848.1 


From 1357753 


to 1358697 


SeqID n° 2184 


LM-385.1 


From 79817 


to 80869 


SeqID n° 2185 


LM-3851.1 


From 1378840 


to 1379901 



Unknown, similar to 
competence-damage 
inducible protein CinA 
unknown 
unknown 
unknown, highly similar to 

cell division inhibitor 
(septum placement) protein 
MinD 
Unknown, similar to 
ribonuclease G 
Unknown 
Unknown, similar to 

unknown protein 
Unknown, similar to 
unknown protein 
Unknown 
Unknown, similar to Xaa- 
His dipeptidase 
Unknown, similar to 
transcription regulator, Lad 
family 
Unknown 
Unknown, similar to 
phosphoglycerate 
dehydrogenase 
Unknown, similar to 
unknown protein 
class I heat-shock protein 
(chaperonin) GroES 
ribonuclease P protein 

component 
hypothetical protein 
unknown, similar to 

unknown protein 
Unknwon, conserved 
hypothetical protein 
unknown, similar to B. 
subtilis YneR protein 
non-heme iron-binding 

ferritin 
unknown, conserved 
hypothetical protein 
unknown, highly similar to 
riboflavin kinase and FAD 

synthase 
Unknwon, similar to E. coli 
Ada protein (06- 
methylguanine-DNA 
methyltransferase) 
Unknown, similar to 
aminopeptidase P 
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SeqID n° 2186 LM-3853.2 From 501693 to 501959 

SeqID n° 2187 LM-3856.2 From 1953836 to 1954237 

SeqID n° 2188 LM-3857.1 From 1963240 to 1963605 

SeqID n° 2189 LM-3858.2 From 2030744 to 2031196 

SeqID n° 21 90 LM-386. 1 From 79047 to 79820 

SeqID n° 2191 LM-3863.2 From 2494891 to 2495268 

SeqID n° 2192 LM-3865.2 From 200094 to 200402 

SeqID n° 2193 LM-3867.1 From 1890288 to 1890653 

SeqID n° 2194 LM-3868.2 From 1428945 to 1429748 

SeqID n° 2195 LM-3869.2 From 1440748 to 1441296 

SeqID n° 2196 LM-387.1 From 78362 to 7881 1 

SeqID n° 2197 LM-3871.1 From 2100845 to 2101366 

SeqID n° 2198 LM-3872.1 From 2100224 to 2100751 

SeqID n° 2199 LM-3873.1 From 2891649 to 2892032 

SeqID n° 2200 LM-3876.1 From 520961 to 521536 

SeqID n° 2201 LM-3878.2 From 640625 to 641215 

SeqID n° 2202 LM-3879.2 From 640300 to 640632 

SeqID n° 2203 LM-388.1 From 78090 to 78374 

SeqID n° 2204 LM-3883.1 From 1664057 to 1664470 



SeqID n° 2205 LM-3886.1 From 1702695 to 1703315 

SeqID n° 2206 LM-3887.1 From 1038336 to 1038557 

SeqID n° 2207 LM-3889.1 From 2524168 to 2524911 

SeqID n° 2208 LM-389.1 From 77149 to 77406 

SeqID n° 2209 LM-3890.1 From 2523835 to 2524068 

SeqID n° 2210 LM-3891.1 From 1724196 to 1724462 

SeqID n° 221 1 LM-3893.1 From 611240 to 612634 

SeqID n° 2212 LM-3896.1 From 1546608 to 1547885 



Unknown, weakly similar to 

transposase 
unknown, similar to similar 
to RNase HI 
unknown, similar to 
conserved hypothetical 
proteins 
Unknown, similar to 
transcriptional regulator 
(Fur family) 
Unknwon, similar to 
carboxyphosphonoenolpyr 
uvate phosphonomutase 
Unknown, similar to glycine 
cleavage system protein H 
Unknwon, similar to B. 
subtilis SpoVG protein 
Unknown, similar to 

unknown protein 
Unknown, conserved 
hypothetical protein 
Unknown 
Unknown 
Unknown, similar to 
unknown protein 
Unknown, similar to cell- 
division initiation protein 

(septum placement) 
Unknown, hypothetical 
secreted protein 
Unknown 
Unknown 
Unknown, conserved 
hypothetical protein 
unknown 
Unknown, weakly similar to 
E. coli MutT protein (dGTP 
pyrophosphohydrolase 
unknown, putative 
cellsurface protein 

unknown 
Unknown, similar to 
carboxylesterase 
unknown 
Unknown 
unknown, similar to 
conserved hypothetical 

proteins 
Unknown, similar to beta- 
glucosidase 
Unknown, similar to 
unknown protein 
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SeqlDn°2213 LM-3897.3 From 1514257 to 1515288 

SeqID n° 2214 LM-3898.1 From 1594535 to 1595119 

SeqID n° 2215 LM-39.1 From 2736597 to 2737628 

SeqID n° 2216 LM-390.3 From 76262 to 76984 

SeqlDn°2217 LM-3902.4 From 1715236 to 1716435 



SeqID n°2218 
SeqID n°2219 
SeqID n° 2220 
SeqID n° 2221 

SeqID n° 2222 

SeqID n° 2223 

SeqID n° 2224 

SeqID n e 2225 



SeqID n° 2226 
SeqID n° 2227 
SeqID n° 2228 
SeqID n° 2229 
SeqID n° 2230 
SeqID n°2231 



LM-3904.3 
LM-3905.2 
LM-3907.1 
LM-3908.1 

LM-3910.2 

LM-3912.1 

LM-3914.1 

LM-3915.1 



LM-392.3 
LM-3922.2 
LM-3924.2 
LM-3929.1 
LM-393.3 
LM-3934.1 



From 819389 
From 502899 
From 1234779 
From 1234465 

From 1409070 

From 2709522 

From 2297304 

From 2145127 



From 75832 
From 987735 
From 986722 
From 833615 

From 75342 
From 356567 



to 820039 
to 504629 
to 1235330 
to 1234776 

to 1409996 

to 2710421 

to 2297981 

to 2145513 



to 76125 
to 988049 
to 987180 
to 834076 

to 75665 
to 356872 



SeqID n° 2232 LM-3942.3 From 2633930 to 2634850 

SeqID n° 2233 LM-3943.4 From 1576107 to 1576397 

SeqID n° 2234 LM-3944.4 From 1576411 to 1576728 

SeqID n° 2235 LM-3947.1 From 1599814 to 1600278 

SeqID n° 2236 LM-3949.2 From 1638420 to 1638908 

SeqID n° 2237 LM-395.2 From 74239 to 75228 



SeqID n" 2238 LM-3950.2 From 1639120 to 1639722 

SeqID n° 2239 LM-3951.1 From 1640225 to 1641004 

SeqID n° 2240 LM-3953.1 From 502424 to 502660 

SeqID n° 2241 LM-3954.2 From 504674 to 504997 



Unknown, similar to 

unknown protein 
Unknown, similar to 
hypothetical GTP binding 

protein 
Unknown, similar to polyol 
dehydrogenase 
unknown 
Unknown, similar to S- 

methionine 
adenosyltransferase 
Unknown 
Unknown 
unknown 
unknown, similar to 

unknown protein 
Unknown, similar to 

unknown protein 
Unknown, conserved 

lipoprotein 
Unknown, similar to 
unknown protein 
Unknown, similar to large 

conductance 
mechanosensitive channel 
protein 
unknown 
Unknown 
Unknown 
unknown 
Unknwon 
Unknown, similar to 

transposase 
Unknown, conserved 
hypothetical protein 
ribosomal protein L27 
Unknown, similar to 

unknown protein 
Unknown, similar to 

unknown protein 
Unknown, similar to 

unknown protein 
Unknown, similar to 
dinitrogenase reductase 
ADP-ribosylation system 
ribosomal protein S4 
Unknown 
Hypothetical orf 
Unknown 
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SeqlDn°2242 LM-3955.1 From 1307763 to 1308170 



SeqlDn°2243 LM-3957.1 From 2360908 to 2361141 
SeqlDn°2244 LM-3958.1 From 2360435 to 2360713 



SeqlDn°2245 LM-3959.2 From 2359948 to 2360184 

SeqlDn°2246 LM-396.2 From 1859272 to 1859787 

SeqlDn°2247 LM-3961.1 From 2494522 to 2494806 

SeqID n° 2248 LM-3966.4 From 1739674 to 1740849 



SeqlDn°2249 LM-397.3 From 1858651 to 1859010 
SeqlDn°2250 LM-3970.1 From 1933037 to 1933270 

SeqlDn°2251 LM-3972.2 From 818789 to 819265 



SeqlDn°2252 LM-3973.2 From 1136900 to 1137400 
SeqlDn°2253 LM-3976.4 From 267530 to 267916 



SeqID n° 2254 
SeqID n° 2255 
SeqID n° 2256 

SeqID n° 2257 

SeqID n° 2258 

SeqID n° 2259 

SeqID n° 2260 
SeqID n° 2261 



LM-3978.3 
LM-3979.3 
LM-398.1 

LM-3981.1 

LM-3982.1 

LM-3984.1 

LM-399.1 
LM-3990.1 



From 267010 
From 266431 
From 1857856 

From 2361897 

From 2362306 

From 270372 

From 1857242 
From 222983 



to 267372 
to 266931 
to 1858611 

to 2362274 

to 2362656 

to 270977 

to 1857772 
to 223261 



SeqID n° 2262 LM-3993.1 From 200522 to 200830 



SeqID n° 2263 
SeqID n° 2264 



LM-3995.1 
LM-3998.1 



From 143016 
From 823638 



to 143258 
to 824168 



SeqID n° 2265 LM-3999.3 From 810985 to 811620 



SeqID n° 2266 LM-4.1 From 2713398 to 2713940 



unknown, conserved 
hypothetical protein, similar 
to B. subtilis YneT protein 

Unknown 
Unknown, similar to 
competence transcription 
factor ComK, N. terminal 
part 
Unknown 
translation initiation factor 
IF-3 

Unknown, similar to 

thioredoxin 
unknown, similar to 
transmembrane transport 
proteins 
ribosomal protein L20 
unknown, similar to 
hypoyhetical protein 
Unknown, similar to 
transcription regulator 
(EbsC from Enterococcus 
faecalis) 
Unknown, similar to 
lipoprotein signal peptidase 
Unknown, similar to 
repressor (penicilinase 

repressor) 
ribosomal protein L12 
ribosomal protein L10 
Unknown, similar to 3 -exo- 
deoxyribonuclease exoA 
protein gp30 
[Bacteriophage A1 18] 

protein gp29 
[Bacteriophage A1 18] 
Unknown, conserved 
hypothetical protein 
Unknown 
Unknwon, highly similar to 
B. subtilis YabO protein 
Unknown, similar to B. 
subtilis SpoVG protein 

unknown 
Unknown, conserved 
hypothetical protein 
Unknown, similar to acyl- 

carrier protein 
phosphodiesterase and to 
NAD(P)H dehydrogenase 
Unknown 
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SeqlDn°2267 LM-40.1 From 2737630 to 2738682 

SeqlDn°2268 LM-4003.2 From 1513917 to 1514171 
SeqlDn°2269 LM-401.1 From 1855952 to 1857184 



SeqlDn°2270 LM-4013.2 From 1926482 to 1926898 



SeqlDn°2271 LM-4015.1 From 1771594 to 1772040 



SeqlDn°2272 LM-402.1 From 1855571 
SeqlDn°2273 LM-4027.2 From 263535 



SeqlDn°2274 LM-4029.2 From 263844 
SeqlDn°2275 LM-403.1 From 1854748 



SeqID n° 2276 
SeqID n° 2277 
SeqID n° 2278 
SeqID n° 2279 



LM-4031.1 
LM-4032.1 
LM-4033.1 
LM-404.1 



From 264432 
From 265029 
From 265494 
From 1854077 



SeqID n° 2280 LM-4040.1 From 504972 
SeqID n° 2281 LM-4041 . 1 From 552563 



SeqID n° 2282 
SeqID n° 2283 



SeqID n° 2284 
SeqID n° 2285 
SeqID n° 2286 



LM-4053.1 From 1270465 
LM-4055.2 From 1288261 



SeqID 
SeqID 
SeqID 
SeqID 
SeqID 
SeqID 
SeqID 
SeqID 
SeqID 
SeqID 



n° 2287 
n° 2288 
n° 2289 
n° 2290 
n° 2291 
2292 
2293 
2294 
2295 
2296 



LM-4058.1 
LM-4059.1 
LM-4065.1 



LM-4082.1 
LM-4083.1 
LM-4084.1 
LM-4088.1 
LM-4089.1 
LM-4090.1 
LM-4091.1 
LM-4096.1 
LM-4097.1 
LM-4098.1 



From 1405945 
From 1406366 
From 2045765 



From 2782739 
From 2780577 
From 2780392 
From 2701254 
From 2700963 
From 2699383 
From 2697267 
From 367248 
From 370416 
From 389538 



to 1855933 
to 263714 



to 264377 
to 1855521 



to 264902 
to 265454 
to 266183 
to 1854682 

to 505277 
to 552931 

to 1270725 
to 1288926 



to 1406226 
to 1406713 
to 2046046 



to 2782894 
to 2780798 
to 2780574 
to 2701445 
to 2701226 
to 2699568 
to 2697446 
to 367532 
to 370634 
to 389717 



SeqID n° 2297 LM-4099.1 From 401386 to 401580 



Unknown, similar to sorbitol 
dehydrogenase 
ribosomal protein S20 
Unknown, similar to 
aminotripeptidase 

(peptidase T) 
unknown, similar to 
transcriptional regulator 
(MarR family) 
unknown, similar to 
putative flavodoxin 
Unknown 
unknown, highly similar to 
preprotein translocase 
subunit 
transcription 
antitermination factor 
Unknown, similar to ABC 
transporter (ATP-binding 
protein) 
Unknown 
ribosomal protein L1 1 
ribosomal protein L1 
Unknown, similar to 
unknown protein 
Unknown 
Unknown, similar to 
unknown protein 
unknown 
unknown, weakly similar to 
oligopeptide ABC 
transporter AppA 
Unknown 
Unknown 
Unknown, similar to pentitol 
PTS system enzyme II B 
component 
Unknown 
Unknown 
Unknown 
ribosomal protein L29 
ribosomal protein S17 
ribosomal protein S14 
ribosomal protein L30 
Unknown 
Unknown 
Unknown, similar to 
conserved hypothetical 
protein 
unknown 



WO 01/77335 



PCT/FRO 1/01118 



151 



SeqlDn°2298 LM-41.1 From 2738719 to 2739990 



SeqlDn°2299 LM-4106.1 From 439009 to 439212 



SeqID n° 2300 LM-4113.1 From 521601 to 521771 

SeqlDn°2301 LM-4115.1 From 616742 to 616951 

SeqID n° 2302 LM-4118.1 From 669086 to 669361 

SeqlDn°2303 LM-4119.1 From 696606 to 696815 

SeqlDn°2304 LM-4120.1 From 707310 to 707486 

SeqlDn°2305 LM-4121.1 From 709601 to 709810 

SeqID n° 2306 LM-4122.1 From 731765 to 732001 

SeqlDn°2307 LM-4123.1 From 756079 to 756291 

SeqlDn°2308 LM-4124.1 From 756494 to 756637 

SeqlDn°2309 LM-4130.1 From 890069 to 890215 

SeqlDn°2310 LM-4134.1 From 961203 to 961469 



SeqlDn°2311 LM-4135.1 From 973509 to 973670 

SeqlDn°2312 LM-4137.1 From 981317 to 981544 

SeqlDn°2313 LM-4139.1 From 987274 to 987501 

SeqID n° 2314 LM-4142.1 From 1003593 to 1003829 

SeqID n° 2315 LM-4144.1 From 1057067 to 1057276 



SeqID n° 2316 


LM-4147.1 


From 


1133466 


to 1133696 


SeqID n° 2317 


LM-4148.1 


From 


1145723 


to 1145944 


SeqID n° 2318 
SeqID n° 2319 


LM-4149.2 
LM-415.1 


From 
From 


1154864 
1848166 


to 1156381 
to 1848690 


SeqID n° 2320 


LM-4150.1 


From 


1188014 


to 1188289 


SeqID n° 2321 


LM-4152.2 


From 


1287376 


to 1287591 


SeqID n" 2322 


LM-4153.1 


From 


1331439 


to 1331666 


SeqID n° 2323 
SeqID n° 2324 


LM-4154.1 
LM-4155.1 


From 
From 


1363826 
1366220 


to 1363975 
to 1366432 


SeqID n° 2325 


LM-4157.1 


From 


1385704 


to 1385931 


SeqID n° 2326 


LM-4158.1 


From 


1387014 


to 1387214 


SeqID n° 2327 


LM-4161.1 


From 


1501881 


to 1502054 



Unknwon, similar to PTS 
system galactitol-specific 
enzyme IIC component 
Unknown, similar to 
putative transcription 

regulator 
ribosomal protein L32 
Unknown, similar to 
unknown protein 
Unknown 
unknown 
unknown 
unknown 
Unknown, weakly similar to 
flagellar switch protein 
unknown, LPXTG motif 
protein 
Hypothetical CDS 
Unknown, hypothetical 

protein 
Unknown, similar to ABC 
transporter, ATP-binding 
protein (C-termina! part) 
unknown 
unknown 
Unknown 
D-alanyl carrier protein 
unknown, similar to B. 
subtilis YkzG protein 
unknown, highly similar to 

TN916 0RF8 
unknown, highly similar to 
TN916 0RF19 
unknown 
Unknown, similar to 

unknown protein 
Unknwon, similar to 
carboxysome structural 
protein 
unknown, similar to 
transcriptional regulator 
unknown, similar to B. 
subtilis YnzC protein 
ribosomal protein L33 
unknown similar to B. 

subtilis yqgQ 
Unknown, similar to 
exodeoxyribonuclease 

small subuntt 
Similar to cold shock 
protein 
30S ribosomal protein S21 



WO 01/77335 



PCT/FR01/01118 



152 



SeqID n° 2328 LM-4162.2 From 1576747 to 1577055 
SeqID n° 2329 LM-4166.1 From 1654691 to 1654909 

SeqlDn°2330 LM-417.1 From 1847472 to 1847960 



SeqID n° 2331 
SeqID n° 2332 
SeqID n° 2333 
SeqID n' 2334 
SeqID n B 2335 



LM-4172.1 
LM-4174.1 
LM-4175.1 
LM-4179.1 
LM-418.1 



From 1756629 
From 1764871 
From 1769479 
From 1859050 
From 1846355 



SeqID n° 2336 LM-4180.1 From 1867180 

SeqID n° 2337 LM-4181.1 From 1881041 

SeqID n° 2338 LM-4182.1 From 1890951 

SeqID n° 2339 LM-4183.1 Frx)m 1902281 

SeqID n° 2340 LM-4184.1 From 1928579 

SeqID n° 2341 LM-4186.1 From 1953513 

SeqID n" 2342 LM-4189.1 From 2094877 



SeqID n° 2343 
SeqID n° 2344 
SeqID n° 2345 
SeqID n° 2346 
SeqID n° 2347 
SeqID n° 2348 



SeqID n° 2349 
SeqID n° 2350 



LM-419.1 
LM-4192.1 
LM-4193.1 
LM-4195.1 
LM-4197.1 

LM-420.1 



LM-4200.1 
LM-4201.1 



From 
From 
From 
From 
From 
From 



1845044 
2144874 
2150556 
2184120 
2192379 
1844250 



From 2234357 
From 2242097 



to 1756862 
to 1765077 
to 1769700 
to 1859250 
to 1847479 

to 1867410 

to 1881274 

to 1891139 
to 1902484 
to 1928785 

to 1953713 

to 2095077 

to 1846336 
to 2145065 
to 2150759 
to 2184347 
to 2192609 
to 1844963 



to 2234533 
to 2242282 



SeqID n° 2351 LM-4203.1 From 2293497 to 2293685 



SeqID n° 2352 
SeqID n' 2353 
SeqID n° 2354 



LM-4206.1 
LM-4207.1 
LM-4208.1 



From 2346593 
From 2361582 
From 2366288 



SeqID n° 2355 LM-4209.1 From. 2387499 
SeqID n° 2356 LM-421 .1 From 1 843993 



SeqID n° 2357 
SeqID n° 2358 
SeqID n° 2359 
SeqID n° 2360 
SeqID n° 2361 
SeqID n° 2362 



LM-4210.1 
LM-421 1.1 
LM-4212.1 
LM-4213.1 
LM-421 4.1 
LM-421 5.1 



From 
From 
From 
From 
From 
From 



2388528 
2389208 
2389491 
2391020 
2394790 
2395586 



to 2346784 
to 2361791 
to 2366446 

to 2387663 

to 1844238 

to 2388710 
to 2389387 
to 2389658 
to 2391217 
to 2394984 
to 2395801 



ribosomal protein L21 
Unknown, hypothetical 
gene 

hosphoribosylaminoimidaz 
ole carboxylase I 
unknown 
unknown 
unknown 
ribosomal protein L35 
phosphoribosylaminoimida 
zole carboxylase II 
Unknown, similar to 
unknown protein 
unknown, highly similar to 
acyl earner proteins 
ribosomal protein L28 

unknown 
unknown, similar to 
putative mercuric ion 

binding proteins 
similar to cold shock 
protein 
similar to major cold-shock 
protein 
adenylosuccinate lyase 
unknown 
unknown 
Unknown 
Unknown 
phosphoribosylaminoimida 
zole succinocarboxamide 
synthetase 
unknown 
unknown, similar to B. 
subtilis YwmG protein 
Unknown, similar to 
unknown protein 
unknown, 
unknwon 
protein gp22 
[Bacteriophage A1 18] 
Bacteriophage A1 18 gp65 
protein 
Unknown, similar to 

unknown protein 
Hypothetical protein 
Unknown 
unknown 
Unknown 
Unknown 
gp44 [Bacteriophage A1 18] 
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SeqlDn°2363 LM-4216.1 From 2398287 to 2398529 
SeqlDn°2364 LM-422.1 From 1843306 to 1843989 

SeqlDn°2365 LM-4225.1 From 2467388 to 2467621 



SeqID n" 2366 
SeqID n° 2367 

SeqID n° 2368 
SeqID n° 2369 

SeqID n° 2370 

SeqID n° 2371 

SeqID n° 2372 

SeqID n° 2373 
SeqID n° 2374 

SeqID n° 2375 
SeqID n° 2376 

SeqID n° 2377 

SeqID n° 2378 

SeqID n° 2379 

SeqID n° 2380 



LM-4226.1 
LM-4227.1 

LM-4228.1 
LM-423.1 

LM-4230.1 

LM-4231.1 

LM-4233.1 

LM-4234.1 
LM-4235.1 

LM-4236.1 
LM-4246.1 

LM-4247.1 

LM-4248.1 

LM-4249.1 

LM-425.1 



From 2468545 
From 2480325 

From 2501306 
From 1841094 

From 2559617 

From 2567696 

From 261 11 92 

From 2624633 
From 2643753 

From 2646464 
From 145354 

From 145085 

From 136714 

From 136469 

From 1839682 



SeqID n e 2381 LM-4251.1 From 77863 

SeqID n° 2382 LM-4253.1 From 52373 

SeqID n° 2383 LM-4254.1 From 50514 

SeqID n° 2384 LM-426.1 From 1838614 



to 2468703 
to 2480528 

to 2501470 
to 1843313 

to 2559817 

to 2567935 

to 261 1410 

to 2624878 
to 2643938 

to 2646664 
to 145560 

to 145171 

to 136992 

to 136702 

to 1841109 



to 78066 
to 52534 
to 50753 
to 1839663 



SeqID n° 2385 LM-4262.2 From 267909 to 268949 

SeqID n° 2386 LM-4267.1 From 1153010 to 1153783 

SeqID n° 2387 LM-4268.1 From 1153848 to 1154204 

SeqID n° 2388 LM-427.1 From 1838051 to 1838617 



SeqID n° 2389 LM-4276.2 From 2943569 to 2943703 
SeqID n° 2390 LM-4277.1 From 2693947 to 2694060 



Unknown, similar to 
transcription regulator 
Unknown, similar to 
phosphoribosylformylglycin 
amidine synthetase II 
Unknown, similar to B. 
subtilis YuzB protein 

Unknown 
Unknown, similar to 
repressor protein 
Unknown 
phosphoribosylformylglycin 
amidine synthetase I 
Unknown, similar to B. 
subtilis yvIC protein 
* Unknown, similar to B. 

subtilis CsbA protein 
unknown, highly similar to 
H+-transporting ATP 

synthase chain c 
ribosomal protein L31 
Unknwon, similar to 4- 
oxalocrotonate isomerase 
unknown 
Unknwon, hypothetical 

protein 
Unknwon, hypothetical 
protein 
Unknwon, similar to E. coli 

YjdJ protein 
Unknown, similar to E. coli 
Yjdl protein 
glutamine 
phosphoribosylpyrophosph 
ate amidotransferase 
Unknown, Hypothetical 
Unknown 
30S ribosomal protein S18 
phosphoribosylaminoimida 
zole synthetase 
Unknown, similar to 
penicillinase antirepressor 
unknown, similar to 
regulatory proteins 
unknown 
unknown, highly similar to 
phosphoribosylglycinamide 
formyltransferases 
ribosomal protein L34 
ribosomal protein L36 
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SeqID n° 2391 LM-428.1 From 1836516 to 1838045 



SeqID n° 2392 LM-429. 1 From 1 835229 to 1 836491 

SeqID n° 2393 LM-4293.1 From 2130228 to 2130401 

SeqID n° 2394 LM-4295. 1 From 2052696 to 2052833 

SeqID n° 2395 LM-43.1 From 2740052 to 2740333 



SeqID n° 2396 LM-430.1 From 1834795 to 1835091 

SeqID n° 2397 LM-432.1 From 1834523 to 1834777 
SeqID n° 2398 LM-433.1 From 1833125 to 1834480 



SeqID n° 2399 LM-4342.1 From 2317644 to 2317841 

SeqID n° 2400 LM-4349.1 From 1287710 to 1288243 

SeqID n° 2401 LM-435.1 From 1832242 to 1832919 

SeqID n° 2402 LM-4352.1 From 436161 to 436352 

SeqID n° 2403 LM-4353.1 From 2250846 to 2251004 

SeqID n 6 2404 LM-4358.1 From 2491285 to 2491473 

SeqID n° 2405 LM-4359,1 From 2173635 to 2173832 

SeqID n° 2406 LM-436.2 From 1 829982 to 1 8321 77 

SeqID n° 2407 LM-4360.1 From 263366 to 26351 5 

SeqID n° 2408 LM-438. 1 From 1 827941 to 1 829956 

SeqID n° 2409 LM-439. 1 From 1826829 to 1 827944 

SeqID n° 2410 LM-44.1 From 2740390 to 2740854 



SeqID n° 2411 LM-440.1 From 1826411 to 1826704 

SeqID n° 2412 LM-441 .1 From 1824936 to 1826387 

SeqID n° 241 3 LM-442. 1 From 1 823494 to 1 824924 

SeqID n° 2414 LM-443.2 From 1822421 to 1823353 

SeqID n° 2415 LM-444.2 From 1821518 to 1822273 



Bifunctional 
phosphoribosylaminoimida 
zole carboxy formyl 
formyltransferase and 
inosine-monophosphate 
cyclohydrolase 

phosphoribosylglycinamide 
synthetase 
ribosomal protein L32 
Unknown 
Unknwon, similar to PTS 
system galactitol-specific 
enzyme IIB component 
Unknown, similar to 
unknown protein 
Unknown 
Unknown, similar to 
putative sodium-dependent 
transporter 
Unknown, similar to 
unknown protein 
Unknown 
..Unknown, similar to 
unknown protein 
Unknown 
Unknown 
unknown 

ATP-dependent DNA 
heiicase 

Unknown, similar to DNA 
ligase 
Unknown, similar to 
unknown protein 
Unknwon, similar to PTS 
system galactitol-specific 
enzyme HA component 
glutamyl-tRNA(Gln) 
amidotransferase (subunit 
C) 

glutamyl-tRNA(Gln) 
amidotransferase (subunit 
A) 

glutamyl-tRNA(Gln) 
amidotransferase (subunit 
B) 

Unknown, similar to 
unknown protein 
Unknown 
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SeqID n° 2416 LM-445.1 From 1820133 to 1821494 



SeqIDn 0 2417 
SeqID n° 2418 
SeqIDn 0 2419 



LM-446. 1 From 1 81 9244 to 1 81 9798 
LM-447.2 From 1818740 to 1819213 
LM-45.2 From 2740888 to 2742957 



SeqID n° 2420 LM-451 . 1 From 2435023 to 2436141 



SeqID n° 2421 
SeqID n° 2422 



LM-452. 1 From 2434627 to 2434866 
LM-453. 1 From 2433224 to 2434618 



SeqID n° 2423 LM-455. 1 From 2431688 to 243321 1 



SeqIDn 0 2424 
SeqID n° 2425 
SeqID n° 2426 
SeqID n° 2427 



LM-456. 1 From 2430826 to 2431296 

LM-458. 1 From 242801 0 to 2430793 

LM-460. 1 From 2427045 to 2427890 

LM-463. 1 From 2426270 to 2427001 



SeqID n° 2428 LM-464. 1 From 2425723 to 2426253 



SeqID n° 2429 
SeqID n° 2430 



LM-466. 1 From 2424922 to 2425533 
LM-468. 1 From 2423480 to 2424721 



SeqID n° 2431 LM-469. 1 From 2421878 to 2423476 



SeqID n° 2432 
SeqID n° 2433 



LM-47.2 From 2743326 to 27441 38 
LM-471 . 1 From 241 9797 to 2421 749 



SeqID n° 2434 LM-472.1 From 2418871 to 2419770 



SeqID n° 2435 
SeqID n° 2436 
SeqID n° 2437 



LM-473. 1 From 241 8083 to 241 8628 
LM-475.1 From 2417531 to 2418067 
LM-476.1 From 2416705 to 2417514 



SeqID n° 2438 LM-477.1 From 2415967 to 2416683 



Unknown, similar to 
hypothetical RNA 
methyltransferase 
Unknown, similar to 

unknown protein 
Unknown, similar to 
shikimate kinase 
unknown, similar to 
transcriptional 
antiterminator (BgIG family) 

Unknown, similar to S. 
pyogenes RofA regulatory 
protein 
Hypothetical protein 
Unknown, similar to 
glutamate decarboxylase 
Unknown, similar to amino 
acid antiporter (acid 

resistance) 
Unknown, conserved 
hypothetical protein 
Unknown, transmembrane 
protein 
Unknown, conserved 
hypothetical protein 
Unknown, similar to N- 
acetylglucosamine-6- 
phosphate isomerase 
Unknown, similar to 
unknown protein 
Unknown 
Unknown; Similar to 
multidrug resistance 

protein 
Unknown, conserved 
hypothetical protein 

Unknown 
Unknown, similar to 
putative Na+/H+ antiporter 
Unknown, similar to LysR 
family transcription 
regulator 
Unknown, similar to NADH- 
dependent FMN reductase 
Unknown, similar to B. 
subtilis Ytml protein 
Unknown, similar to amino 
acid ABC transporter 
(binding protein) 
Unknown, similar to amino 
acid ABC-transporter 
(permease) 
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SeqID n° 2439 LM-478. 1 From 241 5246 to 241 5953 



SeqID n° 2440 
SeqID n° 2441 

SeqID n° 2442 
SeqID n° 2443 
SeqID n° 2444 



LM-48.1 From 2744212 to 2744574 

LM-480. 1 From 2414470 to 241 5249 

LM-481.1 From 2413478 to 2414473 

LM-482.1 From 2413219 to 2413485 

LM-483.1 From 241 1900 to 2413222 



SeqID n° 2445 LM-484.1 From 241 1 126 to 241 1827 



SeqID n° 2446 
SeqID n° 2447 



LM-485. 1 From 2409875 to 241 0993 
LM-486. 1 From 2408967 to 2409878 



SeqID n° 2448 LM-487. 1 From 2408450 to 2408893 



SeqID n° 2449 
SeqID n° 2450 
SeqID n°2451 



SeqID n° 2452 
SeqID n° 2453 

SeqID n° 2454 
SeqID n° 2455 
SeqID n° 2456 
SeqID n° 2457 

SeqID n° 2458 
SeqID n° 2459 



LM-488.1 From 2407128 to 2408453 
LM-49.1 From 2744571 to 2744939 
LM-490.1 From 2406161 to 2406913 



LM-491 . 1 From 2405241 to 2406164 

LM-492. 1 From 2403341 to 2405239 

LM-494. 1 From 2402888 to 2403235 

LM-495.1 From 2402357 to 2402833 

LM-496.1 From 2401 008 to 2402366 

LM-497. 1 From 2400264 to 2400944 

LM-498. 1 From 2399542 to 2400243 

LM-5. 1 From 271 3962 to 271 4942 



SeqID n° 2460 LM-500.1 From 2398685 to 2399161 



Unknown, similar to amino 
acid ABC transporter 

(permease) 
Unknown, conserved 
hypothetical protein 
Unknown, similar to amino 
acid ABC-transporter, ATP- 
binding protein 
Unknown, conserved 
hypothetical protein 
Unknown, similar to B. 
subtilis Ytnl protein 
Unknown, similar to 
nitrilotriacetate 
monooxygenase 
Unknwon, similar to 16S 
pseudouridylate synthase 
Similar to kinases 
Unknown, similar to 
Erwinia chrysanthemi IndA 
protein 
Unknown, conserved 
hypothetical protein 
aminopeptidase C 

Unknown 
Unknown, similar to 
regulatory protein DeoR 

family 
fructose- 1 -phosphate 
kinase 

unknown, highly similar to 
phosphotransferase 
system (PTS) fructose- 
specific enzyme IIABC 
component 
Unknown, similar to 
transcriptional regulator 
competence transcription 
factor 
putative integrase 
[Bacteriophage A118] 
Unknown, weakly similar to 
gp32_Bacteriophage A118 

protein 
Unknown, similar to protein 
gp33 [Bacteriophage A118] 
Unknown, similar to 
heptaprenyl diphosphate 
synthase component II 
Unknown, similar to a 
putative repressor protein 
[Bacteriophage A1 18] 
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SeqID n° 2461 
SeqID n° 2462 



LM-502. 1 From 2397800 to 2398084 
LM-503. 1 From 2397493 to 2397774 



SeqID n° 2463 LM-506. 1 From 2396454 to 2397230 



SeqID n° 2464 
SeqID n° 2465 



LM-507, 1 From 2395798 to 2396331 
LM-509. 1 From 2394317 to 2394793 



SeqID n° 2466 LM-51.1 From 2744985 to 2745791 



SeqID n° 2467 
SeqID n° 2468 



LM-510.1 From 2393613 to 239431 1 
LM-51 2. 1 From 2392622 to 2393596 



SeqID n° 2469 LM-513.1 From 2391813 to 2392625 



SeqID n° 2470 

SeqID n° 2471 

SeqID n° 2472 
SeqID n° 2473 

SeqID n° 2474 
SeqID n° 2475 



SeqID n° 2476 
SeqID n° 2477 
SeqID n° 2478 

SeqID n° 2479 
SeqID n° 2480 



SeqID n° 2481 
SeqID n° 2482 

SeqID n° 2483 
SeqID n° 2484 
SeqID n° 2485 



LM-51 4.1 From 2391220 to 2391816 

LM-517.1 From 2390580 to 2391023 

LM-51 8. 1 From 23901 1 3 to 2390583 

LM-52. 1 From 2745907 to 2746377 

LM-520. 1 From 2389655 to 23901 16 

LM-522. 1 From 2388729 to 238921 1 



LM-523.1 From 2388179 to 2388583 

LM-524. 1 From 2387792 to 23881 75 

LM-525. 1 From 2387046 to 2387480 

LM-527. 1 From 2386141 to 2386680 

LM-529. 1 From 2385301 to 2386095 



LM-53.1 From 2746423 to 2746878 

LM-530. 1 From 2384001 to 2385332 

LM-531 .1 From 2382228 to 2383988 

LM-532. 1 From 2381088 to 2382227 

LM-533.1 From 2380419 to 2381009 



SeqID n° 2486 LM-535.1 From 2379418 to 2380419 



Unknown 
Unknown, similar to protein 
gp41 [Bacteriophage A1 18] 
Unknown, similar to anti- 
repressor [Bacteriophage 
A118] 

gp43 [Bacteriophage A118] 
Unknown, similar to 
bacteriophage proteins 
Unknown, weakly similar to 
AraC-like transcription 
regulator 
Unknown 
Unknown, similar to protein 
gp49 [Bacteriophage A1 18] 
Unknown, similar to site- 
specific DNA- 
methyltransferase 
Unknown, similar to protein 
gp51 [Bacteriophage A1 18] 
Unknown, similar to a 
bacteriophage protein 

Unknown 
Unknown, conserved 
hypothetical protein 
Unknown 
unknown, similar to single- 
stranded DNA-binding 
protein 
Unknown 
unknown 
Protein gp66 
[Bacteriophage A1 18] 

Unknown 
Unknown, similar to 
putative terminase small 
subunit from Bacteriophage 
A118 

Unknown, similar to ribose 
5-phosphate epimerase 
putative terminase large 

subunit from Bacteriophage 
A118 

putative portal protein 
[Bacteriophage A118] 
Protein gp4 [Bacteriophage 
A118] 
Unknown, putative 
. scaffolding protein 
[Bacteriophage A 118] 
Unknown, similar to coat 
protein [Bacteriophage 
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SeqID n° 2487 


LM-536.1 


From 2379004 


to 2379399 


SeqID n° 2488 


LM-537.1 


From 2378642 


to 2379004 


SeqID n° 2489 


LM-538.1 


From 2378304 


to 2378642 


SeqID n° 2490 


LM-539.1 


From 2377897 


to 2378304 


SeqID n° 2491 
SeqID n° 2492 


LM-54.1 
LM-540.1 


From 2747073 
From 2377460 


to 2747414 
to 2377894 


SeqID n° 2493 


LM-541.1 


From 2377198 


to 2377530 


SeqID n° 2494 


LM-542.1 


From 2376721 


to 2377143 


SeqID n° 2495 


LM-543.1 


From 2376110 


to 2376715 


SeqID n° 2496 


LM-548.1 


From 2370736 


to 2376099 


Qonin n° OAQ7 


LM-549. 1 


From 2369916 


to 2370734 


Qonin n° 04QQ 


LM-OD.1 


From 2747427 


to 2748683 


SeqID n° 2499 


LM-550.1 


From 2368882 


to 2369907 


SeqID n° 2500 


LM-552.1 


From 2367853 


to 2368881 


oeqiu n zou i 


1 IV/! CCO «1 


From 2366780 


to 2367853 


SeqID n° 2502 


LM-554.1 


From 2366451 


to 2366768 


SeqID n° 2503 


LM-555.1 


From 2365893 


to 2366258 


SeqID n° 2504 
oeqiu n zovo 


LM-557.1 
LM-ooo.1 


From 2365599 
From 2364754 


to 2365880 
to 2365599 


SeqID n° 2506 
SeqID n° 2507 


LM-559.1 
LM-56.1 


From 2363709 
From 2748685 


to 2364260 
to 2749497 


SeqID n° 2508 


LM-560.1 


From 2363135 


to 2363632 


SeqID n° 2509 


LM-562.3 


From 2362661 


to 23631 10 


SeqID n° 2510 


LM-563.3 


From 318 


to 1673 


SeqID n° 2511 


LM-564.1 


From 1867 


to 3012 



SPP1] 

Protein gp8 [Bacteriophage 
A118] 

Protein gp9 [Bacteriophage 
A118] 
Protein gp10 
[Bacteriophage A118] 

Portein gp11 
[Bacteriophage A118] 

Unknown 
major tail shaft protein 
[Bacteriophage A118] 

Portein gp13 
[Bacteriophage A1 18] 

Protein gp14 
[Bacteriophage A1 18] 

Protein gp15 
[Bacteriophage A118] 
Unknown, putative tape- 
measure [Bacteriophage 
A118] 
Protein gp17 
[Bacteriophage A1 18] 
Unknown, similar to UV- 
damage repair protein 

Protein gp18 
[Bacteriophage A1 18] 

Protein gp19 
[Bacteriophage A118] 

protein gp20 
[Bacteriophage A118] 

protein gp21 
[Bacteriophage A118] 

protein gp23 
[Bacteriophage A118] 
holin [Bacteriophage A1 18] 
L-alanoyl-D-glutamate 
peptidase 
Unknown 
Unknown, similar to 
hydrolase (esterase) 
Unknown, similar to an 
unknown bacteriophage 
protein 
Portein gp28 
[Bacteriophage A118] 
Chromosomal replication 

initiation protein DnaA 
DNA polymerase III, beta 
chain 
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SeqID n° 2512 


LM-565.1 


From 3121 


to 4464 


SeqID n° 2513 


LM-566.1 


From 4644 


to 4865 


SeqID n° 2514 
SeqID n° 2515 


LM-567.1 
LM-57.1 


From 4869 
From 2749538 


to 5981 
to 275023; 


SealD n° 2516 
SeqID n° 2517 
SeqID n° 2518 


LM-570 1 
LM-573.1 
LM-575.1 


From RO^O 

PI Will UwU 

From 8065 
From 10728 


in 7Q70 

to 10593 
to 12242 


SeqID n° 2519 


LM-576.1 


From 12258 


to 12776 


SeqID n° 2520 


LM-578.1 


From 12918 


to 13886 


SeqID n° 2521 


LM-580.1 


From 13843 


to 14814 


SeqID n° 2522 


LM-581.1 


From 14795 


to 15874 


SeqID n° 2523 


LM-582.1 


From 16219 


to 17325 


SeqID n° 2524 


LM-584.1 


From 17344 


to 19323 


SeqID n° 2525 


LM-585.1 


From 19311 


to 19922 


SeqID n° 2526 


LM-586.1 


From 19924 


to 20256 


SeqID n° 2527 


LM-587.1 


From 20308 


to 21426 


SeqID n° 2528 
SeqID n° 2529 
SeqID n" 2530 


LM-589.1 
LM-591.1 
LM-592.1 


From 21652 
From 23132 
From 24194 


to 23085 
to 23953 
to 24934 


SeqID n° 2531 


LM-593.1 


From 24950 


to 25351 


SeqID n° 2532 


LM-594.1 


From 25351 


to 25839 


SeqID n° 2533 


LM-596.1 


From 25862 


to 26665 


SeqID n° 2534 


LM-597.1 


From 26640 


to 27467 



Unknown, conserved 
hypothetical protein 
Unknown, similar to B. 
subtilis YaaA protein 
RecF protein 
Unknown, similar to two 
components response 

regulator 
DNA gyrase subunit B 
DNA gyrase subunit A 
Unknown, similar to 
cardiolipin synthase 

diamine N- 
acetyltransferase 
Unknown, similar to 
mevalonate kinase 
unknown, similar to 
mevalonate diphosphate 
decarboxylase 
Unknown, similar to 
mevalonate kinases 
AA3-600 quinol oxidase 

subunit II 
AA3-600 quinol oxidase 

subunit I 
AA3-600 quinol oxidase 

subunit III 
unknown, highly similar to 
quinol oxidase aa3-600 
chain IV 
Unknown, similar to 
Bacil(jjs anthracis CapA 
protein (polyglutamate 
capsule biosynthesis) 
beta-glucosidase 
Unknown 
Unknown, similar to 
transcriptional regulator 

(GntR family) 
Unknown, similar to PTS 
system, fnjctose-specific 

HA component 
Unknown, similar to PTS 
system, fructose-specific 

IIB component 
Unknown, similar to PTS 
system, fructose-specific 

IIC component 
Unknown, similar to PTS 
system, mannose-specific 
IID component 
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SealD n° 2535 

UwVi|l u-f I | fawVv 


LM-598 1 


From ?74Q5 


to 9ft 1QQ 


univnown, similar to 
phosphoheptose 
isomerasp. 


SeqID n° 2536 


LM-599.1 


From 28254 


to 28895 


Unknown similar to P mli 

Vl IV/VVi 1, 1 HIGH IU Lv, VA/II 

lAjfjpci i \\Ji i ICUoldolo 

orotein CutC 


SeqID n° 2537 


LM-6.1 


From 2715023 


to 2716363 


Unknown 


SeqID n° 2538 


LM-600.1 


From 29100 


to 31004 


Unknown, similar to PTS 
svstem. beta-alucosides 
specific IIA'BC component 


SeqID n° 2539 


LM-601.1 


From 31088 


to 31963 


Unknown similar to F coli 
microcin C7 self-immunity 


SeqID n° 2540 








protein (MccF) 


LM-602.1 


From 32197 


to 32535 


Unknown 


SeqID n° 2541 


LM-604.1 


From 32571 


to 33380 


Unknown, conserved 


SeqID n° 2542 








hypothetical orotein 


LM-605.1 


From 33397 


to 34452 


Unknown, transcriptional 
regulator Lad family 


SeqID n° 2543 


LM-606.1 


From 34654 


to 35619 


Unknown, similar to xylose 
re|jieb5or 


SeqID n° 2544 


t_lvl-OUO. I 


Prom ^Rftift 

rrom ooo io 


10 ooUlo 


Unknown, similar to 
endoglucanase 


SeqID n° 2545 


LM-609.1 


From 38031 


to 39383 


Unknown, similar to PTS 
sysiem, ceuoDiose-speciric 
IIC component 




I M-R10 1 

l_IVI~U 1 \J. I 


Frnm ^ty^ftR 
riLiiii o^ooj 




unknown, similar to 
GIucosamme--fructose-6- 

pi tuofji idle 
dlllll luiidl lolGfdot* 


SeqID n° 2547 








terminal domain) 


LM-6111 


From 40705 


to 41730 


Unknown similar to 

ornithine 
carbamoyltransferase 


SeqID n° 2548 


LM-613.1 


From 41803 


to 431 88 


Unknown similar to amino 
acid transporter 


SeqID n" 2549 


LM-614.1 


From 43175 


to 44266 


Unknown pnn^prvpri 

w i irvi iuvvi i, uui too vcu 

hypothetical protein 


SeqID n° 2550 


LM-615.1 


From 44282 


to 45223 


parhprnAfp IfinaQP 
^aiuaiiiaLC mi laoc 


SeqID n° 2551 


LM-616.1 


From 45325 


to 46434 


Unknown, conserved 
hypothetical protein 


SeqID n° 2552 


LM-617.1 


From 46451 


to 47230 


Unknown enn^prvpri 

Ui irvl iuvvi i, loci vc?U 

hvoothetical orotein 

1 IJf |^wM IVrllwCll fJ* VlVll 1, 

hypothetical regulator 


SeqID n° 2553 


LM-618.1 


From 47335 


to 47994 


Unknown, similar to E. coli 
DedA protein 


SeqID n°2554 


LM-619.2 


From 48074 


to 49306 


Unknown, similar to 
arginine deiminase 


SeqID n° 2555 


LM-62.1 


From 2750230 


to 2752920 


Unknown, similar to the two 
components sensor protein 
kdpD 


SeqID n° 2556 


LM-620.2 


From 703688 


to 703951 


Unknown 
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SeqID n° 


2557 


LM-621.3 


SeqID n° 


2558 


LM-622.1 


SeqID n° 


2559 


LM-623.1 


SeqID n° 


2560 


LM-624.1 


SeqID n° 


2561 


LM-625.1 


SeqID n° 
SeqID n° 


2562 
2563 


LM-626.1 
LM-627.1 


SealD n° 
SeqID n° 
SeqID n° 


2564 
2565 
2566 


LM-628 1 
LM-629.1 
LM-631.1 


SeqID n" 


2567 


LM-632.1 


SeqID n° 


2568 


LM-633.1 


SeqID n° 


2569 


LM-635.1 


SeqID n° 


2570 


LM-636.1 


SeqID n° 


2571 


LM-637.1 


SeqID n° 


2572 


LM-639.1 


SeqID n° 


2573 


LM-641.1 


SeqID n° 
SeqID n" 


2574 
2575 


LM-642.1 
LM-643.1 


SeqID n° 


2576 


LM-645.1 


SeqID n° 
SeqID n° 


2577 
2578 


LM-647.1 
LM-648.1 





161 




From 


703128 


to 703691 


From 


703961 


to 704338 


From 


704452 


to 705387 


From 


705380 


to 706150 


From 


706408 


to 707292 


From 
From 


707612 
709134 


to 708220 
to 709562 


From 
From 
From 


709829 
711121 
711428 


to 710749 
to 711435 
to 712195 


From 


712208 


to 712480 


From 


712483 


to 713244 


From 


713260 


to 714306 


From 


714353 


to 716428 


From 


716450 


to 717673 


From 


717670 


to 718449 


From 


718478 


to 719266 


FrQm 
From 


719291 
719653 


to 719626 
to 720504 


From 


720464 


to 721291 


From 
From 


721301 
721824 


to 721801 
to 723737 
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Unknown, similar to acetyl 
transferase 
Unknown, similar to 
unknown protein 
Unknown, similar to ABC 
transporter (ATP-binding 
protein) 
Unknown, conserved 
membrane protein 
Unknown, similar to 
oxidoreductase 
Unknown 
Unknown, similar to 
unknown protein 
Unknown 
Unknown 
Unknown, similar to 
flagellar biosynthesic 

protein FliP 
Unknown, similar to 
flagellar biosynthesis 

protein FliQ 
Unknown, similar to 
flagellar biosynthetic 

protein FliR 
Unknown, similar to 
flagellar biosynthetic 

protein flhB 
Unknown, similar to 
flagella-associated protein 
flhA 

Unknown, similar to 
flagellar biosynthesis 

protein FlhF 
Unknown, similar to 
flagellar hook-basal body 
protein FlgG 
Unknown, similar to 
chemotactic 
methyltransferase CheR 
Unknown 
Unknown, similar to motility 
protein (flagellar motor 
rotation) MotA 
Unknown, similar to motility 
protein (flagellar motor 
rotation) MotB 
Unknown 
Unknown, similar to 
unknown protein 
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SeqID n° 2579 LM-649. 1 From 723750 to 724658 

SeqID n° 2580 LM-65.1 From 2752936 to 2753508 

SeqID n° 2581 LM-650.3 From 724896 to 725759 

SeqID n° 2582 LM-651 A From 726034 to 726393 

SeqID n° 2583 LM-652.1 From 726413 to 728269 

SeqID n° 2584 LM-653. 1 From 728282 to 728581 



SeqID n° 2585 LM-654.1 From 728600 to 729010 
SeqID n° 2586 LM-657. 1 From 729026 to 730072 
SeqID n° 2587 LM-658.1 From 730074 to 730496 



SeqID n° 2588 LM-659.1 From 730516 to 731751 
SeqID n° 2589 LM-660.1 From 732022 to 733014 

SeqID n° 2590 LM-664. 1 From 73301 7 to 734564 



SeqID n° 2591 
SeqID n° 2592 
SeqID n° 2593 
SeqID n° 2594 
SeqID n° 2595 



LM-665.1 
LM-668.1 
LM-669.1 
LM-670.1 
LM-671.1 



From 734570 
From 735996 
From 737269 
From 737743 
From 738191 



SeqID n° 2596 LM-672. 1 From 739723 



SeqID n° 2597 LM-674.1 From 740610 



SeqID n° 2598 LM-675.1 From 741918 



SeqID n° 2599 LM-676. 1 From 742276 
SeqID n° 2600 LM-677.1 From 742578 



SeqID n° 2601 LM-679. 1 From 742991 



SeqID n° 2602 LM-680.1 From 743418 



to 735973 
to 737162 
to 737733 
to 738171 
to 739711 



to 740598 
to 741899 
to 742304 



to 742557 
to 742979 



to 743401 
to 743714 



Unknown, similar to CheA 

activity-modulating 
chemotaxis protein CheV 

potassium-transporting 
atpase c chain 
flagellin protein 
Chemotaxis response 

regulator CheY 
two-component sensor 
histidine kinase CheA 
Unknown, similar to 
flagellar motor switch 
protein fliY C-terminal part 
Unknown 
Unknown 
unknown, similar to 
flagellar hook assembly 
protein 
Unknown, similar to 
flagellar hook protein FlgE 

Unknown, similar to 
flagellar switch protein FliM 

UNknown, similar to 
flagellar motor switch 
protein fliY 
Unknown 
Unknown 
Unknown 
Unknown 
Unknown, similar to 
flagellar hook-associated 
protein FlgK 
Unknown, similar to 
flagellar hook-associated 
protein 3 FlgL 
Unknown, similar to 
flagellar hook-associated 
protein 2 FliD 
Unknown, similar to 
hypothetical flagellar 
protein 
Unknown 
Unknown, similar to 
flagellar basal-body rod 
protein flgB 
Unknown, similar to 
flagellar basal-body rod 
protein flgC 
Unknown, similar to 
flagellar hook-basal body 
complex protein FliE 
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SeqlD n° 2603 


LM-682.1 


From 743782 


to 745434 


SeqlD n° 2604 


LM-685.1 


From 745437 


to 746543 


SeqlD n° 2605 
SeqlD n° 2606 


LM-686.1 
LM-688.1 


From 746530 
From 747219 


to 747222 
to 748520 


SeqlD n° 2607 


LM-689.1 


From 748537 


to 749205 


SeqlD n" 2608 


LM-69.1 


From 2753523 


to 2755568 


SealD n° 2609 
SeqlD n° 2610 


LM-690 1 
LM-691.1 


From 749219 
From 749985 


to 74QRR^ 

to 750311 


SeqlD n° 2611 
SeqlD n° 2612 


LM-692.1 
LM-693.1 


From 750311 
From 750680 


to 750634 
to 751327 


SeqlD n°2613 


LM-695.1 


From 751 598 


to 753328 


SealD n" 2R14 


I M-RQ7 1 

LI VI* w 27 f . I 


r rum /oo*tyu 


lO 


SeqlD n° 2615 


LM-698 1 


From 755308 


to 756036 


SeqlD n° 2616 


LM-699.2 


From 2842413 


to 2843867 


SeqlD n° 2617 
SeqlD n° 2618 


LM-7.1 
LM-700.1 


From 2716408 
From 2843914 


to 2717394 
to 2844216 


SeqlD n° 2619 


LM-702.1 


From 2844249 


to 2845601 


SeqlD n° 2620 


LM-703.1 


From 2845613 


to 2846497 


SeqlD n° 2621 


LM-704.1 


From 2846490 


to 2846798 


SeqlD n° 2622 


LM-705.1 


From 2846839 


to 2847573 


SeqlD n° 2623 
SeqlD n° 2624 
SeqlD n° 2625 


LM-706.1 
LM-707.2 
LM-708.3 


From 2847673 
From 2848406 
From 2849563 


to 2848335 
to 2849536 
to 2850450 



Unknown, similar to 
flagellar basal-body M-ring 
protein fliF 
Unknown, similar to 
flagellar motor switch 
protein fliG 
Unknown 
Unknwon, similar to H+- 
transporting ATP synthase 
alpha chain Flil, flagellar- 
specific, - 
Unknown, similar to 
transglycosylase 
potassium-transporting 
atpase b chain 
Unknown 
Unknown, similar to 
unknown protein 
Unknown 
putative fibronectin-binding 
protein 
Unknown, similar to 
pyruvate oxidase 
Unknown, similar to metyl- 
accepting chemotaxis 

protein 
Unknown, similar to B. 
subtilis YvpB protein 
Unknown, similar to beta- 
glucosidase 
Unknown 
Unknown, similar to PTS 
cellobiose-specific enzyme 
IIB 

Unknown, similar to PTS 
cellobiose-specific enzyme 

IIC component 
Unknown, similar to xylose 
operon regulatory protein 

and to glucose kinase 
Unknown, similar to PTS 
cellobiose-specific enzyme 
HA 

Unknown, similar to 
hypothetical transcriptional 
regulator 
Unknwon 
Unknwon 
Unknown, similar to ABC 
transporter, ATP-binding 
protein 
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SeqID n° 2626 LM-709.2 From 2850620 to 2852950 



SeqID n° 2627 LM-710.1 From 2852988 to 2854436 

SeqID n° 2628 LM-711.2 From 2854429 to 2856282 

SeqID n° 2629 LM-712.2 From 2856379 to 2857218 

SeqID n° 2630 LM-714.1 From 2857580 to 2858203 

SeqID n° 2631 LM-715.1 From 2858196 to 2860364 

SeqID n° 2632 LM-716.1 From 2860426 to 2860821 

SeqID n° 2633 LM-717.1 From 2861424 to 2862614 

SeqID n° 2634 LM-718.1 From 2862774 to 2863283 

SeqID n° 2635 LM-720. 1 From 2863561 to 2864661 

SeqID n° 2636 LM-721.1 From 2864813 to 2865121 

SeqID n° 2637 LM-723. 1 From 2865127 to 2867397 

SeqID n° 2638 LM-724.1 From 2867432 to 2867731 

SeqID n° 2639 LM-725. 1 From 2867746 to 2869041 

SeqID n° 2640 LM-726.1 From 2869191 to 2871107 



SeqID n° 2641 LM-727.1 From 2871318 to 2872784 
SeqID n° 2642 LM-728.1 From 2872932 to 2873915 
SeqID n° 2643 LM-73. 1 From 2755580 to 2757265 



SeqID n° 2644 LM-730. 1 From 2873915 to 2875837 



SeqID n° 2645 LM-731 .2 From 2875945 to 2876775 
SeqID n° 2646 LM-733.2 From 2877160 to 287801 1 

SeqID n° 2647 LM-734. 1 From 2878004 to 2878765 



Unknown, similar to 
gamma-glutamylcysteine 

synthetase (for the 
isMerminal part) and to 
cyanophycin synthetase 

(C-terminal part) 
Unknown, similar to beta- 

glucosidase 
Unknown, similar to beta- 
glucoside-specific enzyme 
IIABC 
Unknwon, similar to 
transcription antiterminator 
Unknwon, similar to ABC 
transporter, ATP-binding 
protein 
Unknown 
Unknown 
Unknown, similar to efflux 
protein 
Unknown 
Unknown, similar to 
probable GTP-binding 

protein 
Unknown, similar to 
cellobiose PTS enzyme HA 
Unknown, similar to beta- 

glucosidase 
Unknown, similar to PTS, 
cellobiose-specific IIB 

component 
Unknown, similar to 

cellobiose 
phosphotransferase 
system enzyme IIC 
Unknown, similar to 
lichenan operon 
transcription antiterminator 
licR 
cataiase 
Unknown 
unknown, highly similar to 
potassium-transporting 

atpase a chain 
beta-glucoside-specific 
phosphotransferase 
enzyme II 
transcription antiterminator 
Partition protein ParB 

homolg 
Partition protein, ParA 
homolog 
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SeqID n° 2648 
SeqID n° 2649 
SeqID n° 2650 



SeqID n° 2651 
SeqID n° 2652 

SeqID n° 2653 
SeqID n° 2654 

SeqID n° 2655 
SeqID n° 2656 

. SeqID n° 2657 
SeqID n° 2658 

SeqID n° 2659 

SeqID n° 2660 

SeqID n° 2661 
SeqID n° 2662 
SeqID n° 2663 

SeqID n° 2664 
SeqID n° 2665 
SeqID n° 2666 
SeqID n° 2667 
SeqID n° 2668 
SeqID n° 2669 



LM-735. 1 From 2878989 to 2879747 
LM-736. 1 From 2879906 to 2880205 
LM-737. 1 From 2880217 to 2881071 



LM-739.1 From 2881243 to 2882049 

LM-74. 1 From 2757606 to 27579 1 1 

LM-740. 1 From 2882033 to 2882938 

LM-741 . 1 From 2882963 to 2883409 

LM-742. 1 From 2883422 to 2884078 

LM-744. 1 From 28841 41 to 2885547 

LM-747. 1 From 2885581 to 2886582 

LM-748.2 From 2886584 to 2887297 

LM-749.2 From 2536347 to 2538581 

LM-750.1 From 2538588 to 2539196 

LM-7511 From 2539213 to 2539611 

LM-752. 1 From 2539838 to 25401 73 

LM-753. 1 From 2540421 to 2541 857 

LM-754.1 From 2542014 to 2542610 

LM-755. 1 From 2542658 to 2544049 

LM-758.1 From 2545487 to 2546503 

LM-759.1 From 2546530 to 2547501 

LM-760. 1 From 2547509 to 2548477 

LM-761 . 1 From 2548479 to 2549354 



Unknown 
Unknown 
Unknown, highly similar to 

B. subtilis DNA-binding 
protein SpoOJ-like homolog 
YyaA 

Unknown, similar to E. coli 
RpiR transcription regulator 
Unknown, similar to 

cellobiose 
phosphotransferase 
enzyme !IB component 
Unknown, similar to 
transcription regulator 
Unknown, similar to 
phosphotransferase 
system mannitol-specific 
enzyme HA 
Unknown, similar to 

phosphatase 
Unknown, similar to 
phosphotransferase 
system mannitol-specific 
enzyme IIBC 
Unknown, similar to 

dehydrogenase 
Unknown, similar to a 

putative N- 
acetylmannosamine-6- 
phosphate epimerase 
unknown, similar to 

transport protein 
Unknown, similar to 
transcription regulator 
Unknown 
Unknown 
Unknown, similar to 
chitinase and chitin binding 
protein 
ATP-dependent Clp 
protease proteolytic subunit 
Unknown, similar to amino 

acid transporter 
Unknwon, similar to NADH 
oxidase 
Unknown, conserved 
hypothetical protein 
unknown, conserved 
hypothetical protein 
unknown, conserved 
hypothetical protein 
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SeqlDn°2670 LM-762.1 From 2549462 to 2551192 



SeqID n° 2671 LM-763.1 From 2551234 to 2552295 



SeqlD n° 2672 
SeqID n° 2673 
SeqlD n° 2674 
SeqID n° 2675 



LM-764.1 
LM-765.1 
LM-766.1 
LM-767.1 



From 2552312 
From 2553414 
From 2554452 
From 2556013 



to 2553295 
to 2554373 
to 2555927 
to 2556510 



SeqID n° 2676 LM-768.1 From 2556514 to 2557167 



SeqID n° 2677 LM-769.1 From 255721 1 



SeqID n° 2678 LM-77.1 From 2758046 



SeqID n° 2679 

SeqID n° 2680 

SeqID n° 2681 
SeqID n° 2682 

SeqID n° 2683 
SeqID n° 2684 



LM-771.1 From 2558130 

LM-772.1 From 2559255 

LM-774.1 From 2559841 

LM-776.1 From 2561120 

LM-778.1 From 2562591 

LM-78.1 From 2759390 



to 2558044 
to 2759353 

to 2559068 

to 2559608 

to 2561052 
to 2562382 

to 2565461 

to 2759692 



SeqID n° 2685 LM-780.1 From 2565469 to 2567445 

SeqID n° 2686 LM-781.1 From 2567956 to 2568603 

SeqID n° 2687 LM-782.1 From 2568625 to 2569011 

SeqID n° 2688 LM-783.1 From 2569117 to 2569422 



SeqID n° 2689 LM-784.1 From 2569690 to 2570349 



SeqID n" 2690 LM-786.1 From 2570362 to 2571141 



SeqID n° 2691 LM-788.1 From 2571156 to 2571971 



SeqID n 6 2692 LM-789.1 From 2571999 to 2572883 



Unknown, similar to 
phosphomannomutase and 

phosphoglucomutase 
Unknown, similar to aldose 
1-epimerase (mutarotase) 
UDP-glucose 4-epimerase 
thioredoxin reductase 

Unknown 
Unknown, similar to 
acetyltransferase 
Unknown, similar to B. 
subtilis P-Ser-HPr 
phosphatase 
unknown, highly similar to 

proiipoprotein 
diacylglyceryl transferase 
Unknown, similar to 

cellobiose 
phosphotransferase 
enzyme IIC component 
HPr-P(Ser) 
kinase/phosphatase 
Unknown, similar to B. 
subtilis YvID protein 

unknown 
Unknown, similar to B. 
subtilis YvlB protein 
excinuclease ABC (subunit 
A) 

Unknown, similar to 

cellobiose 
phosphotransferase 
enzyme HA component 
excinuclease ABC (subunit 
B) 

Unknown 
unknown 
Unknown, similar to 
transcription regulator ArsR 
family 
Unknown, similar to 
negative regulator of 
phosphate regulon 
unknown, similar to 
phosphate ABC transporter 
(ATP-binding protein) 
unknown, similar to 
phosphate ABC transporter 
(ATP-bir:ding protein) 
Unknown, similar to 
phosphate ABC transporter 
(permease protein) 
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SeqID n° 2693 LM-791 .1 From 2572880 to 2573803 



SeqID n° 2694 LM-793. 1 From 2573897 to 2574805 



SeqID n° 2695 

SeqID n° 2696 

SeqID n° 2697 
SeqID n° 2698 



LM-794.1 

LM-795.1 

LM-796.2 
LM-797.2 



From 2575061 

From 2576836 

From 2577696 
From 2578899 



to 2576836 

to 2577546 

to 2578871 
to 2580347 



SeqID n° 2699 LM-799.2 From 2278639 to 2279292 



SeqID n° 2700 
SeqID n° 2701 
SeqID n° 2702 



LM-8.1 
LM-80.1 
LM-801.1 



From 2717422 
From 2760147 
From 2277398 



to 2718234 
to 2760680 
to 2278513 



SeqID n° 2703 LM-803.1 From 2275520 to 2277325 

SeqID n° 2704 LM-804. 1 From 2274961 to 2275221 

SeqID n° 2705 LM-805.1 From 2274127 to 2274750 

SeqID n° 2706 LM-806.1 From 2272403 to 22741 12 

SeqID n° 2707 LM-807.1 From 2271444 to 2272316 



SeqID n° 2708 LM-809. 1 From 2270465 to 2271454 

SeqID n° 2709 LM-81 . 1 From 2760845 to 2761954 

SeqID n° 271 0 LM-81 0.1 From 2269705 to 2270484 

SeqID n° 271 1 LM-812.1 From 2268961 to 2269701 

SeqID n° 2712 LM-813.2 From 2268424 to 2268900 

SeqID n° 2713 LM-81 7.1 From 2259366 to 2259713 

SeqID n° 2714 LM-818.1 From 2258697 to 2259260 

SeqID n° 2715 LM-820.1 From 2257754 to 2258515 

SeqID n° 2716 LM-821.1 From 2256507 to 2257580 



Unknown, similar to 
phosphate ABC transporter 
(permease protein) 
Unknown, similar to 
phosphate ABC transporter 
(binding protein) 
two-component sensor 
histidine kinase 
two-component response 
regulator 
Unknown 
Unknown, similar to 
cardiolipin synthase 
competence negative 
regulator mecA 
Unknown 
unknown 
Unknown, similar to a 
putative competence 
protein from streptococcus 
pneumoniae 
Unknown, similar to 
oligoendopeptidase 
Unknown 
Unknown 
Unknown 
Unknown, similar to 

ferrichrome ABC 
transporter (binding 

protein) 
Unknown, similar to 
ferrichrome ABC 
transporter (permease) 
Unknown, simlilar to cell 
division protein FtsW 
Unknown, similar to 
ferrichrome ABC 
transporter (ATP-binding 
protein) 
Unknown, similar to 

unknown protein 
Unknown, similar to 

unknown protein 
Unknown, similar to 

unknown protein 
Unknown, similar to 
transcriptional regulator 
(tetR family) 
Unknown, similar to 

dehydrogenase 
Unknown, similar to 
unknown proteins 
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SeqID n° 2717 


LM-823.1 


From 2255034 


to 2256401 


SealD n° 2718 


LM-826 1 


From OJSfKOOA 




SeqID n° 2719 


LM-827.1 


From 2252001 


to 2253224 


SeqID n" 2720 


LM-83.1 


From 2761951 


to 2763081 


SeqID n° 2721 


LM-830.1 


From 2251050 


to 2251979 


SeqID n 2722 


LM-831.1 


From 2250431 


to 2250820 


SeqID n° 2723 


LM-833.1 


From 2249683 


to 2250300 


SeqID n e 2724 
SeqID n° 2725 


LM-834.1 
LM-835.1 


From 2248946 
From 2247998 


to 2249617 
to 2248693 


SeqID n° 2726 


LM-836.1 


From 2247050 


to 2247928 


SeqID n° 2727 


LM-837.1 


From 2245899 


to 2246975 


SeqID n° 2728 


LM-838.1 


From 2245143 


to 2245883 


SeqID n° 2729 
SeqID n° 2730 


LM-839.1 
LM-840.1 


From 2244419 
From 2243448 


to 2245141 
to 2244416 


SeqID n° 2731 


LM-841.1 


From 2242376 


to 2243425 


SeqID n° 2732 
SeqID n°2733 
SeqID n° 2734 


LM-842.1 
LM-843.1 
LM-845.1 


From 2240068 
From 2239672 
From 2236844 


to 2241969 
to 2240013 

kW WW 1 W 

to 2239135 


SeqID n° 2735 


LM-846.1 


From 2235739 


to 2236788 

tW C Mm WW 1 \J\J 


SeqID n° 2736 


LM-847.1 


From 2235305 


to 2235742 


SeqID n a 2737 


LM-848.1 


From 2234982 


to 2235308 


SeqID n° 2738 


LM-849.1 


From 2234575 


to 2234874 


SeqID n° 2739 


LM-85.1 


From 2763082 


to 2765652 


SeqID n 8 2740 


LM-850.1 


From 2233841 


to 2234158 


SeqID n° 2741 


LM-851.1 


From 2233150 


to 2233791 



PCT/FR01/01118 



Unknown, similar to sigma- 
54-dependent 
transcriptional activator 
Unknown, similar to 
propionate CoA- 
transferase 
Unknown, similar to 
antiporter proteins 
Unknown, simlilar to cell 
division protein FtsW 
Unknown, similar to 
unknown proteins 
Unknown, similar to 

glyoxalase I 
Unknown, similar to 
unknown proteins 

Unknown 
Unknown, similar to 
transcription regulator 

CRP/FNR family 
Unknown, similar to 
transcriptional regulator 
(AraQ/XylS family) 
Unknown, similar to 

oxidoreductase 
Unknown, similar to 
unknown proteins 

Unknown 
Unknown, similar to 
unknown proteins 
Unknown, similar to 
oxidoreductase 
Unknown 
Unknown 
Unknown, similar to 
ribonucleoside-diphosphate 
reductase, subunit alpha 
Unknown, similar to 
ribonucleoside-diphosphate 
reductase, subunit beta 
Unknown, similar to 

flavodoxin 
Unknown, similar to 

thioredoxin 
Unknown, similar to 
unknown proteins 
Unknown, highly similar to 
Mg2+ transport ATPase 
Unknown, similar to 
unknown proteins 
Unknown, . similar to 
unknown proteins 
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SeqlDn°2742 LM-853.1 

SeqlDn°2743 LM-854.1 

SeqlDn°2744 LM-856.1 

SeqlDn°2745 LM-857.1 

SeqlDn°2746 LM-858.1 

SeqlDn°2747 LM-859.1 

SeqlDn°2748 LM-860.1 

SeqIDn°2749 LM-861.1 

SeqlDn°2750 LM-862.1 

SeqlDn°2751 LM-863.1 

SeqlDn°2752 LM-865.1 

SeqlDn°2753 LM-866.1 

SeqlDn°2754 LM-867.1 

SeqlDn°2755 LM-869.1 

SeqlDn°2756 LM-87.1 

SeqlDn°2757 LM-870.1 

SeqlDn°2758 LM-871.1 

SeqlDn°2759 LM-872.1 

SeqID n° 2760 LM-874.2 

SeqlDn°2761 LM-875.1 

SeqID n° 2762 LM-876.1 



169 

From 2232087 to 2233094 

From 2231127 to 2231981 

From 2230042 to 2231 058 

From 2229193 to 2229927 

From 2227412 to 22291 54 

From 2226533 to 22271 98 

From 2225471 to 2225944 

From 2224219 to 2225454 

From 2223324 to 2224226 

From 2221671 to 2223128 

From 2221186 to 2221659 

From 2220862 to 2221 1 73 

From 221 9751 to 2220845 

From 221 8879 to 221 9733 

From 2766207 to 2766773 

From 221 7963 to 221 8862 

From 2217222 to 2217953 

From 2216054 to 2216743 

From 24861 36 to 2486921 

From 2484816 to 2486117 

From 2483589 to 24848 1 5 



Unknown, similar to 
unknown proteins 
Unknown, similar to 
transcription regulator LysR 
family 
Unknown, similar to 

unknown protein 
Unknown, similar to 
transcription regulator 
GntR family 
Unknown, weakly similar to 
mannose-6-phosphate 
isomerase 
Unknown 
Unknown, similar to 
unknown protein 
Unknown; similar to ABC 
transporter (membrane 

protein) 
Unknown, similar to ABC 
transporter (ATP-binding 
protein) 
Unknown, similar to 
transcription regulator 
Unknown, similar to PTS 
system, fructose-specific 
enzyme !IA component 
Unknown, similar to PTS 
system, fructose-specific 
enzyme HB component 
Unknown, similar to PTS 
system, (ructose-specific 
enzyme ilC component 

Unknown, similar to 
fructose- i ,6-biphosphate 
aldolase type II 
Unknown, similar to 
transcription regulator, 
TetR family 
Unknown, similar to 
f ru ctose- 1 , 6-biphosphate 
aldo^se type II 
Unknown 
Unknown 
Unknown, similar to ABC 
transporter, ATP-binding 
protein 
Unknown, similar to 
aminotransferase 
Unknown, similar to 
aminotransferase 
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SeqID n° 2763 LM-877.1 From 2483149 to 2483592 



SeqID n° 2764 LM-878.1 From 2481732 to 2483126 



SeqID n° 2765 
SeqID n° 2766 
SeqID n° 2767 
SeqID n° 2768 

SeqID n° 2769 
SeqID n° 2770 



LM-879.1 From 2481023 to 2481574 

LM-880.1 From 2480531 to 2480944 

LM-881 .1 From 2479607 to 2479921 

LM-883. 1 From 2478623 to 2479465 

LM-884. 1 From 2478225 to 2478590 

LM-885. 1 From 2477385 to 2478224 



SeqID n° 2771 LM-886. 1 From 2475860 to 2477251 

SeqID n° 2772 LM-887. 1 From 2475588 to 2475863 

SeqID n° 2773 LM-888. 1 From 2474801 to 2475568 

SeqID n° 2774 LM-889. 1 From 2474309 to 2474752 

SeqID n° 2775 LM-89. 1 From 2766935 to 2768707 

SeqID n° 2776 LM-890.1 From 2472963 to 2474273 

SeqID n° 2777 LM-892.1 From 2472438 to 2472938 

SeqID n° 2778 LM-893. 1 From 2472089 to 2472325 

SeqID n° 2779 LM-896.1 From 2468210 to 2468392 

SeqID n° 2780 LM-897. 1 From 2467643 to 2467969 

SeqID n° 2781 LM-899.1 From 2466713 to 2467342 



SeqID n° 2782 LM-90.1 From 2768944 to 2769273 
SeqID n° 2783 LM-901.1 From 2465630 to 2466625 



SeqID n° 2784 LM-903.1 From 2463969 to 2465180 
SeqID n° 2785 LM-904. 1 From 2463066 to 2463884 



Unknown, similar to 
conserved hypothetical 
proteins 
Unknown, similar to 
conserved hypothetical 
proteins 
Unknown 
Unknown 
unknown 
unknown, similar to B. 
subtilis YunF protein 

Unknown 
Unknown, similar to 
conserved hypothetical 

proteins 
Unknown, similar to B. 
subtilis YunD protein 
Unknown, similar to B. 
subtilis YutD protein 
Unkncwn, similar to 
conserved hypothetical 
protein and to B. subtilis 
YutF protein 
Unknown, similar to 
acetyliransferase 
Unknown, similar to 

auiolysin, N- 
acetylmuramidase 
Unknown, similar to 
conserved hypothetical 
proteins 
Unknown, low temperature 
requirement C protein, also 
similar to B. subtilis YutG 

protein 
Unknown,, similar to NifU 

protein 
Unknown, hypothetical 
CDS 

Unknown, similar to B. 
subtilis YuzD protein 
Unknown; conserved 
hypothetical protein similar 
to B. subtilis YhfK protein 
Unknown 
Unknown, similar to 
hypothetical thioredoxine 

reductase 
Unknown, similar to NADH 
dehydrogenase 
Unknown, similar to B. 
subtilis YwqG protein 



Li 
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SeqID n° 2786 


LM-906.1 


From 2461803 


to 2463029 


SeqID n° 2787 


LM-907.1 


From 2461219 


to 2461692 


beqlD n 2788 


LM-908.1 


From 2460720 


to 2461091 


SeqID n° 2789 


LM-91.1 


From 2769381 


to 2770007 


SeqID n° 2790 


LM-910.1 


From 2460090 


to 2460683 


SeqID n° 2791 


LM-911.1 


From 2459819 


to 2460106 



SeqID n° 2792 LM-912.1 From 2459343 to 2459822 

SeqID n° 2793 LM-91 3. 1 From 2457852 to 2459336 

SeqID n° 2794 LM-91 5. 1 From 2457512 to 2457859 

SeqID n° 2795 LM-916.1 From 2457087 to 2457512 

SeqID n° 2796 LM-91 7.1 From 2454695 to 2457103 

SeqID n° 2797 LM-91 8.1 From 2453138 to 2454349 

SeqID n° 2798 LM-91 9.1 From 2452322 to 2452906 

SeqID n° 2799 LM-92.1 From 2770060 to 2771439 

SeqID n° 2800 LM-920. 1 From 2451840 to 2452235 



Unknov/n, conserved 
hypothetical protein 
Unknown, similar to B. 
subtilis YuiD protein 
Unknown, similar to B. 
subtilis YuxO protein 
Unknown, similar to 
thymidylate kinase 
unknown, similar to 
proteins involved in 
resistance to cholate and to 
NA(+) and in pH 
homeostasis 
unknown, similar to 
proteins involved in 
resistance to cholate and to 
NA(+) and in pH 
homeostasis 
unknown, similar to 
proteins involved in 
resistance to cholate and to 
NA(+) and in pH 
homeostasis 
unknown, similar to 
proteins involved in 
resistance to cholate and to 
NA(+) and in pH 
homeostasis 
unknown, similar to 
proteins involved in 
resistance to cholate and to 
NA(+) and in pH 
homeostasis 
unknown, similar to 
proteins involved in 
resistance xo cholate and to 
NA(+) and in pH 
homeostasis 
unknown, similar to 
proteins involved in 
resistance to cholate and to 
NA(+) and in pH 
homeostasis 
Unknown, similar to multi- 
drug resistance efflux 

pump 
Unknown, similar to 
peptidyl-prolyl cis-trans 
isomerase 
Unknown, similar to lysine 
decarboxylase 
Unknown 
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SeqID n° 2801 LM-922.1 From 2450437 to 2451798 

SeqID n° 2802 LM-923. 1 From 2449878 to 24501 92 

SeqID n° 2803 LM-924.1 From 2449155 to 2449835 

SeqID n° 2804 LM-925. 1 From 2448052 to 24491 55 

SeqID n° 2805 LM-926.1 From 2446747 to 2447913 

SeqID n° 2806 LM-927. 1 From 2446240 to 2446593 



SeqID n° 2807 LM-929.1 From 2445685 to 2446092 
SeqID n° 2808 LM-930.2 From 2444231 to 2445583 

SeqID n° 2809 LM-931 .2 From 2443368 to 24441 26 



SeqID n° 2810 LM-935.1 From 198201 to 199409 



SeqID n° 281 1 LM-936.1 From 197512 to 198204 

SeqID n° 2812 LM-937.1 From 196786 to 197463 
SeqID n° 2813 LM-938.1 From 195793 to 196611 



SeqID n° 2814 LM-939.1 From 194892 to 195629 

SeqID n° 2815 LM-94.1 From 2771623 to 2772612 

SeqID n° 2816 LM-941.1 From 193989 to 194870 

SeqID n° 2817 LM-942.1 From 193593 to 193850 

SeqID n° 2818 LM-943.1 From 192586 to 193473 

SeqID n° 2819 LM-945.1 From 192018 to 192593 

SeqID n° 2820 LM-946.1 From 190690 to 191916 



Unknown, similar to 

aspartate kinase 
Unknown, similar to 
phosphotransferase 
system (PTS) beta- 
glucoside-specific enzyme 

IIB component 
unknown, similar to ABC- 
transporter ATP binding 
. proteins 
Unknown, similar to 
putative ABC-transporter 
transmembrane subunit 
Unknown, similar to 
aminotransferase 
Unknown, similar to B. 
subtilis general stress 
protein 13 containing a 
riboscmal S1 protein 
domain 
Unknown 
glucose-6-phosphate 

isomerase 
Unknown, similar to 
transc, iption regulator 
DeoR family 
Unknown, similar to ABC 
transporter, ATP-binding 

protein 
ABC'transporter, ATP- 
binding protein 
ijhknwon 
Unknown, similar to PurR, 
transcription repressor of 
purine operon of B. subtilis 
Unknwon, similar to a 
putative phospho-beta- 
glucosidase 
Unknown, similar to 
dihydroxyacetone kinase 
Unknwon, similar to B. 
subtilis YabH protein 
Unknwon, highly similar to 
B. subtilis Veg protein 

dimeihyladenosine 
transferase (16S rRNA 

dimethylase) 
Unkwnon, similar to B. 
subtilis YabF protein 
Unknwon, similar to B. 
subtilis YabE protein 
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SeqlDn°2821 LM-947.1 From 189625 to 190398 

SeqlDn°2822 LM-949.1 From 187867 to 189528 

SeqID n° 2823 LM-95.1 From 2772634 to 2773230 

SeqlDn°2824 LM-950.1 From 185572 to 187863 

SeqID n° 2825 LM-955.1 From 182267 to 185569 





LIVI-aOO. I 


riom louyzo 


to 182184 


SeqID n° 2827 
SeqID n° 2828 


LM-96.1 
LM-960.1 


From 2773234 
From 180052 


to 2773608 
to 180900 


Oci|IL/ II £.0£.V 


uivi-yo i . i 


rrom 1 /yl/4 


to 180052 


SeqID n° 2830 


LM-962.1 


From 177924 


to 179138 


SeqID n° 2831 
SeqID n° 2832 


LM-964.1 
LM-965.1 


From 175766 
From 174834 


to 177760 
to 175694 


SeqID n° 2833 


LM-968.1 


From 172903 


to 173208 


SeqID n° 2834 


LM-969.1 


From 172410 


to 172802 


SeqID n° 2835 


LM-97.1 


From 2773714 


to 2774565 


SeqID n° 2836 


LM-970.1 


From 172072 


to 172410 


SeqID n° 2837 
SeqID n° 2838 


LM-975.1 
LM-977.1 


From 168005 
From 167086 


to 169270 
to 167943 


SeqID n° 2839 


LM-978.1 


From 166687 


to 166971 



SeqID n° 2840 LM-98.1 From 2774684 to 2775523 
SeqID n° 2841 LM-981 .1 From 165761 to 166642 



Unknwon, similar to 
conserved hypothetical 
proteins 
Unknwon, similar to oligo- 
1,6-glucosidase 
Unknown, similar to 
hypothetical 
dihydroxyacetone kinase 
Unknwon, similar to alpha- 

(]!ucosidase 
Unknwon, similar to alpha- 
xylosidase and alpha- 
c giucosidase 
Unknwon, similar to sugar 

ABC transporter, 
periplasms sugar-binding 
protein 
unknown 
Unknwon, similar to sugar 
ABC transporter, permease 

protein 
Unknwon, similar to sugar 
ABO transporters, 
permease proteins 
Unknwon, similar to xylose 

repressor 
methionyi-tRNA synthetase 
Unknwon, similar to 
glucose uptake protein 
Unknown, similar to 

transposase 
Unknown, similar to 
transposase (N-terminal 
. part) 
Unknown, similar to 
putative transcription 

regulator 
Unknown, similar to 
transposase C-terminal 
part 
Unknwon 
Unknown, similar to a 
glucose uptake protein 
Unknown, similar to B. 

subtilis transcription 
regulatory protein AbrB 

Unknown, similar to 
conserved hypothetical 

protein 
Unknwon, conserved 
hypothetical protein 
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SeqID n° 2842 


LM-982.1 


From 165492 


to 165764 


SeqID n" 2843 


LM-983.1 


From 164756 


to 165508 


SeqID n° 2844 


LM-984.1 


From 164309 


to 164698 


SeqID n° 2845 
SeqID n" 2846 


LM-987.1 
LM-988.2 


From 163465 
From 162467 


to 164298 
to 163459 


SeqID n° 2847 


LM-990.2 


From 596580 


to 597620 


SeqID n° 2848 


LM-991.1 


From 595843 


to 596538 


SeqID n° 2849 


LM-992.1 


From 595135 


to 595842 


SeqID n° 2850 


LM-994 1 


From 593528 


to 595006 


SeqID n 8 2851 


LM-995.1 


From 592257 


to 593438 


SeqID n 6 2852 
SeqID n a 2853 


LM-996.1 
LM-997.1 


From 591412 
From 590324 


to 592047 
to 591355 


SeqID n° 2854 


LM-998.1 


From 589406 


to 590248 



unknwon, similar to B. 
subtilis YazA protein 
unknwon, conserved 
hypothetical protein 
Unknwoa similar to B. 
subtilis YabA protein 

Unknwon 
Unknwon, similar to B. 
subtilis DMA polymerase III 
(delta subunit) 
Unknown, conserved 
hypothetical protein 
Unknown, similar to 
phospho£i!ycerate mutase 

Unknown, similar to 
phosphogjycerate mutase 

Unknown, similar to di- 
tripeptide ABC transporter 

(membrane protein) 
Unknown, similar to NADH- 
dependent butanol 
dehydrogenase 
.Unknown 
Unknown, similar to 
unknown protein 
Unknown 
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REVENDICATIONS 

1. Sequence nucleotidique de Listeria monocytogenes caractfrisee en ce qu'elle 
correspond a SEQ ID N° 1 . 
5 2. Sequence nucleotidique de Listeria monocytogenes, caracteriset: en ce qu'elle 
est choisie parmi : 

a) une sequence nucleotidique comportant au moins 80 % d'identite avec 
SEQIDN 0 ! ; 

b) une sequence nucleotidique hybridant dans des conditions de forte 
1 0 stringence avec SEQ ID N° 1 ; 

c) une sequence nucleotidique complementaire de SEQ ID 
N° 1 ou complementaire d'une sequence nucleotidique teile que definie 
en a), ou b), ou une sequence nucleotidique de TARN correspondant ; 

d) une sequence nucleotidique de fragment repr&entatif de SEQ ID N° 1, 
15 ou de fragment repr6sentatif d'une sequence nucleotidique telle que 

d6finie en a), b) ou c); 

e) une sequence nucleotidique comprenant une sequence te!]e que definie 
en a), b), c) ou d) ; et 

f) une sequence nucleotidique modifiee d'une sequence nucidotidique telle 
20 que d6finie en a), b), c), d) ou e). 

3. Sequence nucleotidique selon la revendication 2, caractdrisee en ce qu'il s'agit 
d'une sequence issue de SEQ ID N° 1 , et en ce qu'elle code pour ivn polypeptide 
choisi parmi les polypeptides de sequence SEQ ID N° 1 k SEQ ID N° 2854, 

4. Sequence nucleotidique caracteris^e en ce qu'elle comprend une sequence 
25 nucleotidique choisie parmi ; 

a) une sequence nucleotidique selon la revendication 3 ; 

b) une sequence nucleotidique comportant au moins 80 % d'identit6 avec 
une sequence nucleotidique selon la revendication 3 ; 

c) une sequence nucleotidique s'hybridant dans des conditions de forte 
30 stringence avec une s6quence nucleotidique selon la revendication 3 ; 

d) une sequence nucleotidique complementaire ou d'ARN correspondant a 
une sequence telle que definie en a), b) ou c) ; 
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e) une sequence nucleotidique de fragment representatif d'une sequence 
telle que definie en a), b), c) ou d) ; et 

f) une sequence nucleotidique modifiee d'une sequence telle que definie en 
a), b), c), d) ou e). 

5 5. Polypeptide code par une sequence nucleotidique selon Tune des revendications 
2a4. 

6. Polypeptide selon la revendication 5, caracterise en ce qu'il est choisi parmi les 
sequences SEQ ID N° 2 a SEQ ID N° 2854. 

7. Polypeptide caracterise en ce qu'il comprend un polypeptide choisi parmi : 
10 a) un polypeptide selon Tune des revendications 5 et 6 ; 

b) un polypeptide presentant au moins 80 % d' identity avec un polypeptide 
selon Tune des revendications 5 et 6 ; 

c) un fragment d'au moins 5 acides amines d'un polypeptide selon Tune 
des revendications 5 et 6, ou tel que defini en b) ; 

15 d) un fragment biologiquement actif d'un polypeptide selon l'une des 

revendications 5 et 6, ou tel que defini en b) ou c) ; et 
e) un polypeptide modifie d'un polypeptide selon Tune des revendications 
5 et 6 ou tel que defini en b), c) ou d). 

8. Sequence nucleotidique codant pour un polypeptide selon la revendication 7. 
20 9. Sequence nucleotidique selon Tune des revendications 2 a 4, et 8, caract&isee 

en ce qu'elle code pour un polypeptide de Listeria monocytogenes implique 
dans la biosynthese des acides amines ou Tun de ses fragments. 

10. Sequence nucleotidique selon Tune des revendications 2 a 4, et 8, caracteris^e 
en ce qu'elle code pour un polypeptide de Listeria monocytogenes implique 

25 dans la biosynthese des cofacteurs, groupes prosthetiques et transporters ou 
Tun de ses fragments. 

11. Sequence nucleotidique selon Tune des revendications 2 a 4, et 8, caract6risde 
en ce qu'elle code pour un polypeptide d'enveloppe cellulaire ou situe a la 
surface de Listeria monocytogenes ou Tun de ses fragments, ledit polypeptide 

30 etant de preference SEQ ID N° 2 a SEQ ID N° 41 . 

12. Sequence nucleotidique selon Tune des revendications 2 a 4, et 8, caract6ris6e 
en ce qu'elle code pour un polypeptide de Listeria monocytogenes impliqu^ 
dans la machinerie cellulaire ou 1'un de ses fragments. 
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13. Sequence nucleotidique selon Tune des revendications 2 a 4, et 5, caracterisee 
en ce quelle code pour un polypeptide de Listeria monocytogenes implique 
dans Ie metabolisme intermediate central ou l'un de ses fragments. 

14. Sequence nucleotidique selon Tune des revendications 2 a 4, et 8, caracterisee 
5 en ce qu'elle code pour un polypeptide de Listeria monocytogenes implique 

dans le metabolisme 6nergenique ou Tun de ses fragments. 

15. Sequence nucleotidique selon Tune des revendications 2 a 4, et 8, caracterisee 
en ce qu'elle code pour un polypeptide de Listeria monocytogenes implique 
dans le metabolisme des acides gras et des phospholipides ou Tun de ses 

10 fragments. 

16. Sequence nucleotidique selon Tune des revendications 2 a 4, et 8, caracterisee 
en ce qu'elle code pour un polypeptide de Listeria monocytogenes implique 
dans le metabolisme des nucleotides, des purines, des pyrimidines ou 
nucleosides ou l'un de ses fragments. 

15 17. Sequence nucleotidique selon Tune des revendications 2 a 4, et 8, caracterisee 
en ce qu'elle code pour un polypeptide de Listeria monocytogenes implique 
dans les fonctions de regulation ou Tun de ses fragments. 

18. Sequence nucleotidique selon l'une des revendications 2 a 4, et 8, caracterisee 
en ce qu'elle code pour un polypeptide de Listeria monocytogenes implique 

20 dans le processus de replication ou Tun de ses fragment. 

19. Sequence nucleotidique selon Tune des revendications 2 a 4, et 8 : caracterisee 
en ce qu'elle code pour un polypeptide de Listeria monocytogenes implique 
dans le processus de transcription ou l'un de ses fragments. 

20. Sequence nucleotidique selon l'une des revendications 2 a 4, et 8 3 caracterisee 
25 en ce qu'elle code pour un polypeptide de Listeria monocytogenes implique 

dans le processus de traduction ou Tun de ses fragments. 

21. Sequence nucleotidique selon l'une des revendications 2 a 4, et 8, caracterisee 
en ce qu'elle code pour un polypeptide de Listeria monocytogenes implique 
dans le processus de transport et de liaison des proteines ou l'un de ses 

30 fragments. 

22. Sequence nucleotidique selon l'une des revendications 2 a 4, et 8, caracterisee 
en ce qu'elle code pour un polypeptide de Listeria monocytogenes implique 
dans Tadaptation aux conditions atypiques ou l'un de ses fragments. 
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23. Sequence nucleotidique selon l'une des revendications 2 a 4, et 8, caracterisee 
en ce qu'elle code pour un polypeptide de Listeria monocytogenes implique 
dans la sensibilite aux medicaments et analogues ou Tun de ses fragments. 

24. Sequence nucleotidique caracterisee en ce qu'elle code pour un polypeptide de 
5 Listeria monocytogenes implique dans la biosynthese de la vitamine B12, ledit 

polypeptide etant de preference SEQ ID N° 42 a SEQ ID N°64. 

25. Sequence nucleotidique selon Tune des revendications 2 a 4, et 8, caracteris6e 
en ce qu'elle code pour un polypeptide de Listeria monocytogenes implique 
dans les fonctions relatives aux transposons ou Tun de ses fragments. 

10 26. Sequence nucleotidique selon Tune des revendications 2 a 4, et 8, caracterisee 
en ce qu'elle code pour un polypeptide specifique de Listeria monocytogenes ou 
Tun de ses fragments. 

27. Polypeptide selon Tune des revendications 5 a 7, caracterise en ce qu'il s'agit 
d'un polypeptide de Listeria monocytogenes implique dans la biosynthese des 

1 5 acides amines ou Tun de ses fragments. 

28. Polypeptide selon Tune des revendications 5 a 7, caracterise en ce qu'il s'agit 
d'un polypeptide de Listeria monocytogenes implique dans la biosynthese des 
cofacteurs, groupes prosthetiques et transporters ou Tun de ses fragment. 

29. Polypeptide selon Tune des revendications 5 a 7, caracterise en ce qu'il s'agit 
20 d'un polypeptide d'enveloppe cellulaire ou situ£ a la surface de Listeria 

monocytogenes ou Tun de ses fragments, ledit polypeptide etant de preference 
SEQ ID N° 2 a SEQ ID N° 41. 

30. Polypeptide selon Tune des revendications 5 a 7, caracterise en ce qu'il s'agit 
d'un polypeptide de Listeria monocytogenes implique dans la machinerie 

25 cellulaire ou l'un de ses fragments. 

31. Polypeptide selon Tune des revendications 5 a 7, caracterise en c$ qu'il s'agit 
d'un polypeptide de Listeria monocytogenes implique dans le metabolisme 
intermediate central ou l'un de ses fragements. 

32. Polypeptide selon l'une des revendications 5 a 7, caracterise en ce qu'il s'agit 
30 d'un polypeptide de Listeria monocytogenes implique dans le metabolisme 

energetique ou Tun de ses fragments. 
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33. Polypeptide selon Tune des revendications 5 a 7, caracterise en ce qu'il s'agit 
d'un polypeptide de Listeria monocytogenes implique dans le metabolisme des 
acides gras et des phospholipides ou Tun de ses fragments. 

34. Polypeptide selon Tune des revendications 5 a 7, caracterise en ce qu'il s'agit 
5 d'un polypeptide de Listeria monocytogenes implique dans le metabolisme des 

nucleotides, des purines, des pyrimidines ou nucleosides ou Tun de ses 
fragments. 

35. Polypeptide selon Tune des revendications 5 a 7, caracterise en ce qu'il s'agit 
d'un polypeptide de Listeria monocytogenes implique dans les fonctions de 

1 0 regulation ou Tun de ses fragments. 

36. Polypeptide selon Tune des revendications 5 a 7, caracterise en ce qu'il s'agit 
d'un polypeptide de Listeria monocytogenes implique dans le processus de 
replication ou l'un de ses fragments. 

37. Polypeptide selon Tune des revendications 5 a 7, caracterise en ce qu'il s'agit 
15 d'un polypeptide de Listeria monocytogenes implique dans le processus de 

transcription ou l'un de ses fragments. 

38. Polypeptide selon 1'une des revendications 5 a 7, caracterise en ce qu'il s'agit 
d'un polypeptide de Listeria monocytogenes implique dans le processus de 
traduction ou Pun de ses fragments. 

20 39. Polypeptide selon l'une des revendications 5 a 7, caracterise en ce qu'il s'agit 
d'un polypeptide de Listeria monocytogenes implique dans le processus de 
transport et de liaison des proteines ou l'un de ses fragments. 

40. Polypeptide selon Tune des revendications 5 a 7, caracterise en ce qu'il s'agit 
d'un polypeptide de Listeria monocytogenes implique dans Tadaptation aux 

25 conditions atypiques ou l'un de ses fragments. 

41. Polypeptide selon l'une des revendications 5 a 7, caracterise en ce qu'il s'agit 
d'un polypeptide de Listeria monocytogenes implique dans la sensibilite aux 
medicaments et analogues ou l'un de ses fragments. 

42. Polypeptide selon l'une des revendications 5 a 7, caracterise en ce qu'il s'agit 
30 d'un polypeptide de Listeria monocytogenes implique dans la biosynthese de la 

vitamine B12, ledit polypeptide etant de preference SEQ ID N° 42 a SEQ ID 
N°64. 
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43. Polypeptide selon Tune des revendications 5 a 7, caracterise en ce qu'il s'agit 
d'un polypeptide de Listeria monocytogenes impliqu6 dans les fonctions 
relatives aux transposons ou Tun de ses fragments. 

44. Polypeptide selon Tune des revendications 5 a 7, caracterise en ce qu'il s'agit 
5 d'un polypeptide specifique de Listeria monocytogenes ou "un de ses 

fragments. 

45. Sequence nucleotidique selon l'une des revendications 1 a 4, et S a 26 , et/ou 
sequence de polypeptide selon Tune des revendications 5 a 7. et, 27 a 44, 
caracterisee(s) en ce que ladite ou lesdites sequences est (sont) enregistree(s) sur 

10 un support d'enregistrement dont la forme et la nature facilitei;t la lecture, 
l'analyse et/ou Texploitation de ladite ou desdites sequences. 

46. Sequence nucleotidique ou sequence de polypeptide selon la revendication 45, 
caracterisee en ce que le support d'enregistrement est un CD-ROM, une 
disquette informatique ou un serveur informatique. 

15 47. Support d'enregistrement caracterise en ce qu'y est enregistre une sequence 

nucleotidique ou de polypeptide selon la revendication 45 ou 46. 
48. Utilisation d'un support selon la revendication 47 pour le choix d'arnorces ou de 

sondes nucl6otidiques pour la determination de genes dans des souches proches 

de Listeria monocytogenes. 
20 49. Utilisation d'un support selon la revendication 47 pour 1'etude du 

polymorphisme genetique de souches proches de Listeria monocytogenes. 
50. Utilisation d'un support selon la revendication 47, pour 1'etude d'autres 

genomes en particulier pour l'annotation automatique de genes provenant 

d'autres genomes. 

25 51. Sequence nucleotidique utilisable comme amorce ou comme sonde, caracterisee 

en ce que ladite sequence est choisie parmi les sequences nucleotidiques selon 

l'une des revendications 2 a 4, et, 8 a 26. 
52. Sequence nucleotidique selon la revendication 51, caracterisee ea.ce qu'elle est 

marquee par un compose radioactif ou par un compose non radioactif. 
30 53. Sequence nucleotidique selon l'une des revendications 51 et 52, caracterisee en 

ce qu'elle est immobilis^e sur un support, de maniere covalente ou non- 

covalente. 
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54. Sequence nucleotidique selon la revendication 53, caracterisee en ce qu'elle est 
immobilisee sur un support tel qu'un filtre a haute densite ou une p:;ce a ADN. 

55. Sequence nucleotidique selon Tune des revendications 52 a 55 pour la detection 
et/ou l'amplification de sequences nucleiques. o 

5 56. Puce a ADN ou filtre, caracterise en ce qu'il contient au moins une sequence 
nucleotidique selon la revendication 55. 
57. Puce a ADN ou filtre selon la revendication 56, caracterise en ce qu'il contient 
en outre au moins une sequence nucleotidique d'un micro-organisme autre que 
Listeria monocytogenes, immobilisee sur le support de ladite puce. 
10 58. Puce a ADN ou filtre selon la revendication 57, caracterise en ce que le micro- 
organisme autre est choisi parmi un micro-organisme associe a Listeria 
monocytogenes, une bacterie du genre Listeria, et un variant de Listeria 
monocytogenes. 

59. Kit ou necessaire pour la detection et/ou Identification de bacterie* appartenant 
15 a l'espece Listeria monocytogenes ou a un micro-organisme associe, caracterise 

en ce qu'il comprend une puce k ADN ou un filtre selon la revendication 56. 

60. Kit ou necessaire pour la detection et/ou 1' identification d'un micro-organisme, 
caracterise en ce qu'il comprend une puce a ADN ou un filtre selon Tune des 
revendications 57 et 58. 

20 61. Kit ou necessaire pour la detection et/ou la quantification de l'expression d'au 
moins un gene de Listeria monocytogenes, caracterise en ce qu'il comprend une 
puce a ADN ou un filtre selon l'une des revendications 56 a 58. 
62. Vecteur de clonage, et/ou d'expression, caracterise en ce qu'ii contient une 
sequence nucleotidique selon Tune des revendications 2 a 4 et 8 a 26. 

25 63. Vecteur de clonage, et/ou d'expression selon la revendication 62,,csracterise en 
ce qu'il contient une sequence nucleotidique selon la revendication 71. 

64. Vecteur de clonage, et/ou d'expression selon la revendication 62, caracterise en 
ce qu'il contient une sequence nucleotidique selon la revendication 14. 

65. Vecteur de clonage, et/ou d'expression selon la revendication 62, caracterise en 
30 ce qu'il contient une sequence nucleotidique selon la revendication 9, 19 ou 20. 

66. Vecteur de clonage, et/ou d'expression selon la revendication 62, caracterise en 
ce qu'il contient une sequence nucleotidique selon la revendication 12, en 



WO 01/77335 



PCT/FR01/01118 



182 

particulier codant pour une proline impliquee dans les m^canismes de 
secretion. 

67. Vecteur de clonage, et/ou depression selon la revendication 62, caract&ise en 
ce qu'il contient une sequence nucleotidique selon la revendication 24. 
5 68. Vecteur de clonage, et/ou d'expression selon la revendication 62, caract&ise en 
ce qu'il contient une sequence nucleotidique codant pour une proteine impliquee 
dans la resistance et/ou Padaptation au stress. 
69. Cellule hote, caracterisee en ce qu'elle est transformee par un vecteur selon 
Tune des revendications 62 a 68. 
10 70. Cellule hote selon la revendication 69, caracterisee en ce qu'il s'agit d'une 
bacterie appartenant au genre Listeria. 

71. Cellule h6te selon la revendication 70, caracterisee en ce qu'il s'agit d'une 
bacterie appartenant a 1 'espSce Listeria monocytogenes. 

72. Vegetal ou animal, excepte PHomme, comprenant une cellule transformee selon 
1 5 Tune des revendications 69 a 7 1 . 

73. Procede de preparation d'un polypeptide, caracterise en ce que Von cultive une 
cellule transformee par un vecteur selon la revendication 62 dans des conditions 
permettant l'expression dudit polypeptide et que Ton recupere ledit polypeptide 
recombinant. 

20 74. Polypeptide recombinant susceptible d'etre obtenu par un procede selon la 
revendication 73. 

75. Procede de preparation d'un polypeptide synthetique selon l'une des 
revendications 5 a 7, 27 a 44, caracterise en ce que Ton effectue une synthese 
chimique dudit polypeptide. 
25 76. Polypeptide hybride, caracterise en ce qu'il comprend au moins la sequence 
d'un polypeptide selon Tune des revendications 5 a 7, 27 a 44 et 74, et une 
sequence d'un polypeptide susceptible dinduire une reponse immunitaire chez 
l'homme ou P animal. 

77. Sequence nucleotidique codant pour un polypeptide hybrids selon la 
30 revendication 76. 

78. Vecteur caracterise en ce qu'il contient une sequence nucleotidique selon la 
revendication 77. 
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79. Anticorps monoclonal ou polyclonal, ses fragments, ou anticorps chimerique, 
caract6ris6 en ce qu'il est capable de reconnaftre specifiquement un polypeptide 
selon Tune des revendications 5 a 7, 27 a 44, 74 ou 76. 

80. Anticorps selon la revendication 79, caracterise en ce qu'il s'agit d : un anticorps 
5 marqu6. 

8LProced£ pour la detection et/ou ridentification de bacteries appartenant a 
Tespece Listeria monocytogeries ou a un microorganisme associe dans un 
6chantillon biologique, caracterise en ce qu'il comprend les Stapes suivantes : 

a) mise en contact de l'echantillon biologique avec un anticorps selon Tune 
1 0 des revendications 79 et 80; 

b) mise en evidence du complexe antigene-anticorps eventuellement formS. 

82. Precede pour la detection de l'expression d'un gene de Listeria monocytogenes 
caracterise en ce que Ton met en contact une souche de Listeria monocytogenes, 
avec un anticorps selon la revendication 79 ou 80 et que 1'on d&ecte le 

1 5 complexe antigene/anticorps eventuellement form6. 

83. Kit ou n6cessaire pour la mise en oeuvre d'un procede selon la revendication 81 
ou 82, caracterise en ce qu'il comprend les elements suivants : 

a) un anticorps selon Tune des revendications 79 et 80; 

b) eventuellement, les reactifs pour la constitution du milieu propice a la 
20 reaction immunologique ; 

c) Eventuellement, les reactifs permettant la mise en evidence des 
complexes antigene-anticorps produits par la reaction immunologique. 

84. Polypeptide selon Tune des revendications 5 a 7, 27 a 44, 74 et 76, ou anticorps 
selon Tune des revendications 79 et 80, caracterise en ce qu'il est immobilise 

25 sur un support, notamment une puce a proteine. 

85. Puce a proteine, caract6risee en ce qu'elle contient au moins un polypeptide 
selon Tune des revendications 5 a 7, 27 a 44, 74 et 76, ou au moins un anticorps 
selon l'une des revendications 79 et 80, immobilise sur le support de ladite 
puce. 

30 86. Puce a proteine selon la revendication 85, caracterisee en ce qu'elk-t contient en 
outre au moins un polypeptide de micro-organisme autre que Listeria 
monocytogenes ou au moins un anticorps dirige contre un compose de micro- 
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organisme autre que Listeria monocytogenes, immobilise sur le support de ladite 
puce. 

87. Kit ou necessaire pour la detection et/ou Identification de bacteries appartenant 
a l'espece Listeria monocytogenes ou a un micro-organisme associe, caracterise 

5 en ce qu'il comprend une puce a proteine selon Tune des revendications 85 et 
86. 

88. Kit ou necessaire pour la detection et/ou l'identification d'un micro-organisme, 
caracterise en ce qu'il comprend une puce a proteine selon la revendication 86. 

89. Procede de detection et/ou d'identification de bacteries appartensnl a l'espece 
1 0 Listeria monocytogenes ou a un micro-organisme associe dans uvi echantillon 

biologique, caracterise en ce qu'il met en oeuvre une sequence nucleotidique 
selon Tune des revendications 1 a 4, 8 a 26, 51 a 55 et 77. 

90. Proc&te selon la revendication 89, caracterise en ce qu'il comporte les Stapes 
suivantes : 

1 5 a) eventuellement, isolement de 1'ADN a partir de l'echantillon biologique a 

analyser, ou obtention d'un ADNc a partir de I'AKN de I' echantillon 
biologique ; 

b) amplification specifique de TADN de bacteries appartenant a l'espece 
Listeria monocytogenes ou a un micro-organisme associe k 1'aide d'au 

20 moins une amorce selon Tune des revendications 51 a 55 ; 

c) mise en Evidence des produits d'amplification. 

91. Procede selon la revendication 89, caracterise en ce qu'il compre-^l les etapes 
suivantes : 

a) mise en contact d'une sonde nucleotidique selon- Tune des 
25 revendications 51 a 55, avec un echantillon biologique, 1'acide nucleique 

contenu dans l'echantillon biologique ayant, le c?.s echeant, 
prealablement ete rendu accessible a l'hybridation, dans des conditions 
permettant l'hybridation de la sonde a 1'acide nucl&que d'une bactfrie 
appartenant a l'espece Listeria monocytogenes ou a un micro-organisme 
30 associe ; 

b) mise en evidence de l'hybride Eventuellement forme entre la sonde 
nucleotidique et 1'acide nucleique de l'echantillon biologique. 
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92. ProcSde selon la revendi cation 89, caracterise en ce qu'il comprerid les etapes 
suivantes : 

a) mise en contact d'une sonde nucleotidique immobilisee sur.un support 
selon la revendication 53 avec un echantillon biologique, l'acide 

5 nucleique de Tdchantillon ayant, le cas echeant, eteprealaLblement rendu 

accessible a l 9 hybridation, dans des conditions permettant "hybridation 
de la sonde a l'acide nucleique d'une bacterie appartenant a l'espece 
Listeria monocytogenes ou a un micro-organisme associe ; 

b) mise en contact de 1' hybride forme entre la sonde nucleotidique 
10 immobilisee sur un support et l'acide nucleique contenu dans 

1 'echantillon biologique, le cas echeant apres elimination de l'acide 
nucleique de I'echantillon biologique n'ayant pas hybride avec la sonde, 
avec une sonde nucleotidique marquee selon la revendication 52 ; 

c) mise en evidence du nouvel hybride forme a l'etape b). 

15 93. Proc&ie selon la revendication 92, caracterise en ce que, prealablement a l'etape 
a), l'ADN de I'echantillon biologique ou l'ADNc obtenu eventuellement par 
transcription inverse de l'ARN de I'echantillon, est amplifie a Taide. d'au moins 
une amorce selon Tune des revendications 51 a 55. 

94. Kit ou necessaire pour la detection et/ou 1'identification de bacteries 
20 appartenant a l'espece Listeria monocytogenes ou a un micro-organisme 

associe, caracterise en ce qu'il comprend les elements suivants : 

a) une sonde nucleotidique selon Tune des revendications 51 a 3 5; 

b) eventuellement, les reactifs necessaires a la mise en csuvre d'une 
reaction d'hybridation ; 

25 c) eventuellement, au moins une amorce selon Tune des revendications 51 

a 55 ainsi que les reactifs necessaires a une reaction d'amplification de 
l'ADN. 

95. Kit ou necessaire pour la detection et/ou 1'identification de bacteries appartenant 
a l'espece Listeria monocytogenes ou a un micro-organisme associe, caracterise 

30 en ce qu'il comprend les elements suivants : 

a) une sonde nucleotidique, dite sonde de capture, selon la 
revendication 53 ; 



WO 01/77335 



PCT/FR01/01118 



186 

b) une sonde oligonucleotidique, dite sonde de revelation, selon la 
revendication 52; 

c) eventuellement, au moins une amorce selon Tune des 
revendications 51 a 55 ainsi que les reactifs nececsaires a une 

5 reaction d'amplification de PADN. 

96. Kit ou necessaire pour la detection et/ou Identification de batteries appartenant 
a Pespece Listeria monocytogenes ou a un micro-organisme associd, caracterise 
en ce qu'il comprend les elements suivants : 

a) au moins une amorce selon Tune des revendications 5 1 a 55; 
10 b) eventuellement, les reactifs necessaires pour effectuer une reaction 

d'amplification d' ADN ; 
c) eventuellement, un composant permettant de verifier la sequence du 
fragment amplifie, plus particulierement une sonde oligonucleotidique 
selon Tune des revendications 51 a 55. 
1 5 97. Precede selon les revendications 81, 82 et 89 a 93 ou kit ou necessaire selon les 
revendications 83, 87, 88 et 94 a 96 pour la detection et/ou P identification de 
batteries appartenant a Pespece Listeria monocytogenes, caracteriG^ en ce que 
ladite amorce et/ou ladite sonde sont choisies parmi les sequences 
nucleotidiques selon Pune des revendications 2 a 4, 8 a 26, 51 a 55 et 77 
20 sp6cifiques de Pespece Listeria monocytogenes, en ce que lesdits polypeptides 
sont choisis parmi les polypeptides selon Pune des revendications 5 a 7, 27 a 44 
et 74 ou 76 specifiques de Pespece Listeria monocytogenes et en os que lesdits 
anticorps sont choisis parmi les anticorps selon Pune des revendications 79 et 80 
diriges contre les polypeptides choisis parmi les polypeptides selon Pune des 
25 revendications 5 a 7, 26 a 44, 74 ou 76 specifiques de Pespece Listeria 
monocytogenes. 

98. Procede ou kit ou necessaire selon la revendication 97, caracteriso en ce que 
ladite amorce et/ou ladite sonde sont choisies parmi les sequences 
nucleotidiques codant pour une proteine secretee, en ce que lesdits polypeptides 
30 sont choisis parmi les polypeptides secretes et en ce que lesdits anticorps sont 
choisis parmi les anticorps selon Pune des revendications 79 et 80 diriges contre 
des polypeptides secretes ou de surface de Listeria monocytogenes, lesdits 
polypeptide etant de preference SEQ ID N° 2 a SEQ ID N°41 . 
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99. Souche de Listeria monocytogenes, caracterisee en ce qu'elle contient au moins 
une mutation dans au moins une sequence nucleotidique seloii Tune des 
revendications 2 a 4 ou 8 a 26. 

100. Souche de Listeria monocytogenes selon la revendication 99, caracterisee en 
5 ce qu'elle contient une mutation dans une sequence nucleotidique selon la 

revendication 13. 

101. Souche de Listeria monocytogenes selon la revendication 99, cai acterisee en 
ce qu'elle contient une mutation dans une sequence nucleotidique selon la 
revendication 14. 

10 1 02. Souche de Listeria monocytogenes selon la revendication 99, caracterisee en 
ce qu'elle contient une mutation dans une sequence nucleotidique selon la 
revendication 9, 19 ou 20. 

103. Souche de Listeria monocytogenes selon la revendication 99, caracterisee en 
ce qu'elle contient une mutation dans une sequence nucleotides selon la 

15 revendication 12, en particulier codant pour une proteine impliquee dans les 
mecanismes de secretion. 

104. Souche de Listeria monocytogenes selon la revendication 99, c^cterisee en 
ce qu'elle contient une mutation dans une sequence nucleotidique codant pour 
une proteine impliquee dans la resistance et/ou I'adaptation au stress 

20 105. Souche de Listeria monocytogenes selon la revendication 99, caracterisee en 
ce qu'elle contient une mutation dans une sequence nucleotidique codant pour 
une proteine impliquee dans la biosynthese de la vitamine B12. 
106. Souche de Listeria monocytogenes selon 1'une des revendications 99 a 105, 
caracterisee en ce que la mutation mene a une inactivation du gene. . 

25 107. Souche de Listeria monocytogenes selon 1'une des revendications 99 a 105, 
caracterisee en ce que la mutation mene a une surexpression du gene. 

108. Utilisation d'un polypeptide selon Tune des revendications 5 a 7, 27 a 44, 74 
et 76 d'une cellule transformee selon 1'une des revendications 69 a 71 d'une 
souche selon la revendication 99 a 107 et/ou d'un animal selon la revendication 

30 72, pour la biosynthese ou la biod^gradation d'uri compose d'interet. 

109. Procede de biosynthese ou de biodegradation d'un compose d'interet, 
caracterise en ce qu'il met en oeuvre un polypeptide selon Tune des 
revendications 5 a 7, 26 a 44, 74 et 76, une cellule transformee selon 1'une des 
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revendications 69 a 71 une souche selon Tune des revendications 99 a 107 et/ou 
un animal selon la revendication 72. 
1 10. Utilisation selon la revendication 108 ou proc&Ie selon la revendication 109, 
caracterises en ce que le compose d'interet est la vitamine B12. 
5 111. Utilisation d'une sequence nucleotidique selon Tune des revendications 2 a 
4, et, 8 a 26, d'un polypeptide selon Tune des revendications 5 a 7, 27 a 44, 74 
et 76, d'un anticorps selon Tune des revendications 79 et 80, d'une cellule 
selon Tune des revendications 69 a 71, et/ou d'un animal transforme selon la 
revendication 72 pour la selection de compose organique cu inorganique 

10 capable de moduler, de reguler, d'induire ou d'inhiber I'expression de genes, 
et/ou de modifier la replication cellulaire de cellules eucaryotes ou procaryotes 
ou capables d'induire, d'inhiber ou d'aggraver chez un organisms animal ou 
humain les pathologies liees a une infection par Listeria monocytogenes ou par 
un micro-organisme associe. 

15 112. Methode de selection de compose capable de se Her a un polypeptide selon 
1'une des revendications 5 a 7, 27 a 44, 74 et 76, capable de se lier a une 
sequence nucleotidique selon l'une des revendications 2 a 4, et, 3 a 26, ou 
capable de reconnaitre un anticorps selon Tune des revendication:, 79 et 80, 
et/ou capable de moduler, de reguler, d'induire ou d'inhiber i'expression de 

20 genes, et/ou de modifier la replication cellulaire de cellules eucaryotes ou 
procaryotes, ou capables d'induire, d'inhiber ou d'aggraver chez un organisme 
animal ou humain les pathologies liees a une infection pur Listeria 
monocytogenes, caracteris^e en ce qu'elle comprend les etapes suivantes : 

a) mise en contact dudit compose avec ledit polypeptide, ladito: sequence 
25 nucleotidique, avec une cellule transformee selon Fune des 

revendications 69 a 71, et/ou administration dudit compose k un animal 
transforme selon la revendication 72; 

b) determination de la capacite dudit compose a se lier avec ledit 
polypeptide ou ladite sequence nucleotidique, ou de moduler, de reguler, 

30 d'induire ou d'inhiber I'expression de genes, ou de rnoduler la 

croissance ou la r6plication cellulaire, ou d'induire, d'inhiber ou 
d'aggraver chez ledit organisme animal ou humain les pathologies ltees 
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a une infection par Listeria monocytogenes ou par un micro-organisme 
associe. 

113. Compose susceptible d'etre s&ectionne par une methode selon la 
revendication 112. 

5 114. Composition pharmaceutique comprenant un compose choisi parmi les 
composes suivants : 

a) une sequence nucleotidique selon Tune des revendications 2 a 4, et 8 a 
26; 

b) un polypeptide selon l'une des revendications 5 a 7, 27 a 44, 74 et 76 ; 
10 c) un vecteur selon Tune des revendications 62 a 68, et 78 ; 

d) un anticorps selon la revendication 79 ou 80 ; et 

e) un compose selon la revendication 113. 

115. Composition selon la revendication 1 14, eventuellement en association avec 
un vehicule pharmaceutiquement acceptable. 
15 116. Composition pharmaceutique selon Tune des revendications 114 et 115 pour 
la prevention ou le traitement d'une infection par une bacterie appartenant a 
1'espece Listeria monocytogenes ou par un micro-organisme associe. 

117. Composition vaccinale, caracterisee en ce qu'elle comprend un gu plusieurs 
polypeptides selon Tune des revendications 5 a 7, 27 a 44, 74, et/ou un ou 

20 plusieurs polypeptides hybrides selon la revendication 76. 

118. Utilisation d'une cellule selon Tune des revendications 62 a 68, pour la 
preparation d'une composition vaccinale. 

1 19. Composition vaccinale, caracterisee en ce qu'elle contient un vecteur selon 
l'une des revendications 62 a 68 et 78, et/ou une cellule selon Tune des 

25 revendications 69 a 7 1 . 

120. Composition vaccinale capable d'induire une reponse immuriteire cellulaire 
ou humorale pour la prevention ou Ie traitement d'une infection par bacterie 
appartenant a 1'espece Listeria monocytogenes ou par un micro-organisme 
associe, caracterisee en ce qu'elle comprend une composition immunogene 

30 selon l'une des revendications 117 et 119, en association avec an vehicule 
pharmaceutiquement acceptable et, eventuellement un ou plusieurs adjuvants de 
l'immunite appropries. 
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121. Banque d'ADN genomique d'une bacterie du genre Listeria, caracterisee en 
ce que ladite banque d'ADN est clonee dans des chromosomes artificiels de 
bacteries (BAC). 

122. Banque d'ADN genomique selon la revendication 121, caracterisee en ce 
5 que ladite bacterie est Listeria monocytogenes. 

123. Banque d'ADN genomique selon la revendication 121, car*cterisee en ce 
que ladite bacterie est la souche EGD-e de Listeria monocytogenes. 

124. Banque selon la revendication 122, caracterisee en ce qu'tl s'agit de la 
banque LM-baclib depos6e a la CNCM le 11 Avril 2000, soug le numero 

10 d'ordre 1-2439. 

125. Methode pour Pisolement d'un polynucleotide d'interet present chez une 
souche de Listeria et absente chez une autre souche, caracterise en ce quelle 
utilise au moins une banque selon Tune des revendications 121 ou 124. 

126. Methode selon la revendication 125, caracterisee en ce qu'elle comprend les 
1 5 etapes suivantes : 

a) isoler au moins un polynucleotide contenu dans un clone- -de la banque 
d'ADN basee sur un BAC, d'origine de Listeria, 

b) isoler : 

- au moins un polynucleotide genomique ou ADNc d'une Listeria, ladite 
20 Listeria appartenant a une souche differente de la souche utilisee pour la 

construction de la banque d'ADN BAC de 1'etape a) ou, de fa9on 
alternative, 

- au moins un polynucleotide contenu dans un clone d'une banque d'ADN 
basee sur un BAC prepare a partir du genome d'une Listeria qui est 

25 differente de la Listeria utilisee pour la construction de la banque 

d'ADN basee sur le BAC de 1'etape a). 

c) hybrider le polynucleotide de 1'etape a) au polynucleotide de 1'etape b) ; 

d) s&ectionner les polynucleotides de 1'etape a) qui n'ont pas form6 de 
complexe d'hybridation avec les polynucleotides de l'etaps b) ; 

30 e) caracteriser le polynucleotide selectionne. 

127. Banque genomique d'une bacterie du genre Listeria. 

128. Banque genomique selon la revendication 127, caracterisee en ce que la 
bacterie est Listeria monocytogenes. 



